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RECOMBINASE-BASED METHODS FOR PRODUaNG EXPRESSION 
VECTORS AND COMPOSITIONS FOR USE IN PRACTIONG THE SAME 

10 



viM nf the Invention 

The field of this invention is molecular biology, particularly recombinant DNA 

15 engineering. 

Rar.kp ;rnund of the Invention 

The processes of isolating, cloning and expressing genes are central to the field 
of molecular biology and play prominent roles in research and industry in 
biotechnology and related fields. Until recently, the isolation and cloning of geneshas 
been achieved in vitro using restriction endonucleases and DNA ligases. Restriction 
endonucleases are enzymes which recognize and cleave double-stranded DNA at a 
specific nucleotide sequence, and DNA hgases are enzymes which join fragments of 
DNA together via the phosphodiester bond. A DNA sequence of interest can be "cut" 
or digested into manageable pieces using a restriction endonuclease and then inserted 
into an appropriate vector for cloning using DNA hgase. However, in order to transfer 
the DNA of interest into a different vector-most often a specialized expression vector- 
-restriction enzymes must be used again to excise the DNA of interest from the 
cloning vector, and then DNA ligase is used again to ligate the DNA of interest into 

the chosen expression vector. 

The ability to transfer a DNA of interest to an appropriate expression vector is 
often limited by the availability or suitability of restriction enzyme recognition sites. 
Often multiple restriction enzymes must be employed to remove the desired codmg 
region Further, the reaction conditions used for each enzyme may differ such that it is 
necessary to perform the excision reaction in separate steps, or it may be necessary to 
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^„,e a partcular ^zyme used m an inWa, resmction enzyme ■»» » 

con,pWng subsequent resricion enzyme diges^ons due .0 buffer a«l/or cofa«ur 

i„eo.upa.ib,»y. Many of .hese extra sups require ,in,e-co„sun,b,g purificadon of .he 

subcloning intermediate. 
5 TT^ere is. therefore, a need to develop protocols and compositions for the rapid 

transfer of a DNA molecule of interest from one vector to another in vitro or in nvo 

without the need to rely upon restriction enzyme digestions. 

p,.i»vantT.iterature 

U S Patents of interest include: U.S. Patent Nos. 5.527,695; 5,744,336; 
10 5.851.808;5.888.732;and5.962.255.AlsoofinterestisLi«etal.,CurrentBiology 

(1998)8:1300-1309. 

^TT^T^^^PY invention 

Methods are provided for producing an expression vector. In the subject 
„.ethods, donor and acceptor vectors are combined in the presence of a recombmase to 
produce an expression vector that includes a first and second recombinase recogmt,on 
site oriented in the same direction, wherein said first and said second recombmase 
recognition sites are capableofrecombiningwith each other.ln the subject methods. 

one of the donor and acceptor vectors includes a single recombinase recogmtion site 
20 while the other includes two recombinase recognition sites. Also provided are 
compositions for use in practicing the subject methods, including the donor and 
acceptor vectors themselves, as well as systems and kits that include the same. T^c 
subject invention finds use in a variety of different applications. 



15 



25 



30 



RRTFF DESCPTPTT^^ FIGURES 
Figure 1 A provides a schematic representation of a preferred embodiment of 

the subject methods. Figure IB provides the reading firamespDNR-1 topDNR-3 

vectors depicted in Figs. 2A to 2C, respectively. 

Figures 2A to 2D provide schematic representations of four different donor 

plasmid vectors. i.e., pDNR-1; pDNR-2; and pDNR-3; pDNR-Lib according to a 

preferred embodiment of the subject invention. 

Figures 3A to 3 J provide schematic representations of 1 5 different acceptor 

plasmids. i.e.. pLP-GADT7; pLP-GBKT7; pIP-EGFP-Cl ; pI^-ECFP-Cl ; pLP- 

EYFP-Cl ; pLP-IRESneo; pLP-IRES2-EGFP; pLP-TRE2; pLP-RevTRE; and pLP- 



2 



PCTAJSOO/19221 

WO 01/05961 

LNCX suitable for use with the donor plasmids pDNR-1; pDNR-2 andpDNR-3. Other 
specific acceptor vectors of interest are pLP-ProTet; pLP-CMV-Myc; pLP-CMV-HA; 
pLP-Shuttle; pLP-AdenoX. as described more folly in the specification. 

J nF.F]NlTIONS 

As used herein, the term "donor construct" refers to a donor vector, i.e., a 
donor nucleic acid construct comprising two donor sequence-specific recombinase 
target sites each having a defined 5' to 3' orientation and placed in the donor construct 
such that they have the same 5' to 3' orientation, and a unique restriction enzyme site 
10 or polylinker. wherein the restriction enzyme site or polylinker is located 3' of the 
first-donor sequence-specific recombinase target site and 5' of the second-donor 
sequence-specific recombinase target site, and wherein the recombinase recogmUon 
sites are capable of recombining with each other. 

As used herein, the term "first donor fragment" or "desired donor fragment" 
, 5 refers to the fragment produced when the donor construct is resolved, comprising a 
single sequence-specific recombinase target site having a 5' to 3' orientation wherem 
the 5' half of the single sequence-specific recombinase target site is derived from the 
5' half of the second-donor sequence-specific recombinase target site in the donor 
construct and the 3' half of the single sequence-specific recombinase target site is 
20 derived fix,m the 3' half of the first-donor sequence-specific recombinase target site of 
the donor construct, a polylinker or unique restriction site 3' to said sequence-specific 
recombinase target site, and the donor-partial selectable marker, or in certain 
embodiments, a donor-fimctional selectable marker. It is the first donor fragment that 
^11 combine with the acceptor construct to produce the final desired recombination 
25 product. 

As used herein the term "second donor fragment" or "non-desired donor 
fragment" refers to the fragment produced when the donor construct is resolved, 
comprising a single sequence-specific recombination target site in which the 5' half is 
derived from the 5' half of the first-donor sequence-specific recombinase target site 
from the donor construct and the 3' half is derived from the 3' half of the second- 
donor sequence-specific recombinase target site from the donor construct. 

As used herein, the term "acceptor construct" refers to an acceptor nucleic acid 
construct comprising at least one origin of replication, an acceptor sequence-specific 
recombinase target site having a defined 5» to 3' orientation, a first promoter located at 
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the 5- end of the acceptor sequence-specific recombinase target site, and in certain 
embodiments, an acceptor-partial selectable marker. 

As used herein, "final recombination constructs" refers to the recombination 
products produced when either the first donor fragment or the second donor fi-agment 
recombines with an acceptor construct, i.e., to generate expression vectors produced 

by the subject methods. 

As used herein, "final desired recombination construct" refers to a 
recombination product produced when the first, or desired, donor firagment recombines 
with an acceptor construct, i.e., an expression construct. 

The teims "sequence-specific recombinase" and "site-specific recombinase" 
refer to enzymes or recombinases that recognize and bind to a short nucleic acid site or 
"sequence-specific recombinase target site", i.e., a recombinase recognition site, and 
catalyze the recombination of nucleic acid in relation to these sites. These enzymes 
include recombinases, transposases and integrases. 
1 5 The terms "sequence-specific recombinase target site", "site-specific 

recombinase target site", "sequence-specific target site" and "site-specific target site" 
refer to short nucleic acid sites or sequences, i.e., recombinase recognition sites, which 
are recognized by a sequence- or site-specific recombinase and which become die 
crossover regions during a site-specific recombination event. Examples of sequence- 
20 specific recombinase target sites include, but are not limited to, lox sites, att sites, dif 
sites and fit sites. 

The term "lox site" as used herein refers to a nucleotide sequence at which the 
product of the ere gene of bacteriophage PI, the Cre recombinase, can catalyze a site- 
specific recombination event. A variety of lox sites are known in the art, including the 
25 naturally occurring loxP, loxB, loxL and loxR. as well as a number of mutant, or 
variant, lox sites, such as loxPSl 1, loxP514, loxA86, loxAl 17, loxC2, loxP2, loxP3 
and lox P23. 

The term "frt site" as used herein refers to a nucleotide sequence at which the 
product of the FLP gene of the yeast 2 micron plasmid, FLP recombinase, can catalyze 
30 site-specific recombination. 

The term "unique restriction enzyme site" indicates that the recognition 
sequence of a given restriction enzyme appears once within a nucleic acid molecule. 

A restriction enzyme site or restriction site is said to be located "adjacent to the 
3' end of a sequence-specific recombinase target site" if the restriction enzyme 
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recognition site is located downstream of the 3' end of the sequence-specific 
recombinase target site. The adjacent restriction enzyme site may, but need not, be 
contiguous with the last or 3' most nucleotide comprising the sequence-specific 
recombinase target site. 
5 The terms "polylinker" or "multiple cloning site" refer to a cluster of 

restriction enzyme sites, typically unique sites, on a nucleic acid construct that can be 
utilized for the insertion and/or excision of nucleic acid sequences, such as the coding 
region of a gene, ioxP sites, etc. 

The term "temiination sequence" refers to a nucleic acid sequence which is 
10 recognized by the polymerase of a host cell and results in the tennination of 
transcription. Prokaryotic termination sequences commonly comprise a GC-rich 
region that has a two-fold symmetry followed by an AT-rich sequence. A commonly 
used termination sequence is the T7 termination sequence. A variety of termination 
sequences are known in the art and may be employed in the nucleic acid constructs of 
15 the present invention, including the TINT3, TL13, TL2, TRl, TR2, and T6S 

termination signals derived from the bacteriophage lambda, and termination signals 
derived from bacterial genes, such as the trp gene of £. coli. 

The terms "polyadenylation sequence" (also referred to as a "poly A* site" or 
••poly A* sequence") as used herein denotes a DNA sequence which directs both the 
20 termination and polyadenylation of the nascent RNA transcript. Efficient 

polyadenylation of the recombinant transcript is desirable, as transcripts lacking a poly 
A* tail are typically unstable and rapidly degraded. The poly A" signal utilized in an 
expression vector may be "heterologous" or "endogenous". An endogenous poly A^ 
signal is one that is found naturally at the 3' end of the coding region of a given gene 
25 in the genome. A heterologous poly A^ signal is one which is isolated from one gene 
and placed 3' of another gene. e.g.. coding sequence for a protein. A commonly used 
heterologous poly A* signal is the SV40 poly A" signal. The SV40 poly A^ signal is 
contained on a 237 bp BammiBca restriction fragment and directs both termination 
and polyadenylation; numerous vectors contain the SV40 poly A* signal. Another 
30 commonly used heterologous poly A" signal is derived from the bovine growth 
hormone (BGH) gene; the BOH poly A"^ signal is also available on a number of 
commercially available vectors. The poly A^ signal from the Herpes simplex virus 
thymidine kinase (HSV tk) gene is also used as a poly A* signal on a number of 
conunercial expression vectors. 
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As used herein, the terms "selectable marker" or "selectable marker gene" refer 
to a gene which encodes an enzymatic activity and confers the ability to grow m 
medium lacking what would othemise be an essential nutrient; in addition, a 
selectable marker may confer upon the cell in which the selectable marker is 

5 expressed, resistance to an antibiotic or drug. A selectable marker may be used to 
confer a particular phenotype upon a host cell. When a host cell must express a 
selectable marker to grow in selective medium, the marker is said to be a positive 
selectable marker (e.g.. antibiotic resistance genes which confer the ability to grow m 
the pr^ence of the appropriate antibiotic). Selectable markers can also be used to 

10 select against host cells containing a particular gene; selectable markers used m this 
manner are referred to as negative selectable markers. 

AS used herein, the term "donor-partial selectable marker" found in certain 
embodiments of the subject invention refers to portion of a selectable marker gene 
encoded by the donor construct which is non-functional byitself, by which IS meant 

15 that it must be positioned on a vector in operable relation with another element m 
Older to be expressed. Examples of donor-partial selectable markers are coding 
sequences and promoter regions of complete selectable markers of fiinctiomng 

expression modules or cassettes. 

As used herein, the term "donor-functional selectable marker" found in certain 
20 embodiments of the subject invention refers to a functional selectable marker gene 

encoded by the donor construct. 

As used herein, the term "acceptor-partial selectable marker" found in certam 
embodiments of the subject invention refers to a portion of a selectable marker gene 
encoded by the acceptor construct which is non-functional by itself, as described 
25 above, e.g., a coding sequence or promoter by itself 

As used herein, the term "acceptor-functional selectable marker" found m 
certain embodiments of the subject invention refers to a functional selectable marker 
gene encoded by the acceptor construct. 

As used herein, the term "recombinant-functional selectable marker" refers to 
30 the functional selectable marker genecr^ted upon recombination bet^veen the donor 
construct and the acceptor constnict which results in the adjacent placement of the 
donor-partial selectable marker and the acceptor-partial selectable marker, i.e.. 
flanking either side of a recombinase site. 
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AS used herein, the term "construct" is used in reference to nucleic acid 
molecules that transfer DNA segment(s) fiom one cell to another. The term "vector- 
is sometimes used interchangeably with "constmct". The term "construct" mcludes 
circular nucleic acid constructs such as plasmid constructs, phagemid constructs, 
5 cosmid vectors, etc., as well as Hnear nucleic acid constructs including, but not limited 
to PGR products. The nucleic acid constmct may comprise expression signals such as 
apromoter and/or an enhancer in operable linkage, and then is generally referred to as 
an "expression vector" or "expression construct". 

The term "expression construct" as used herein refers to an expression module 
10 or expression cassette made up of a recombinant DNA molecule containing a desired 
coding sequence and appropriate nucleic acid sequences necessary for the expression 
of the operably linked coding sequence in a particular host organism. Nucleic ac,d 
sequences necessary for expression in prokaryotes usually include a promoter and a 
ribosome binding site, often along with other sequences. Eukaryotic cells are known 
15 to utilize promoters, enhancers, and termination and polyadenylation signals. 

The terms "in operable combination", "in operable order" and "operably 
Uriced" as used herein refer to the linkage of nucleic acid sequences in such a manner 
that a nucleic acid molecule capable of directing the transcription of a given gene 
and/or the synthesis of a desired protein molecule is produced. The terms also refer to 
20 the linkage of amino acid sequences in such a mamier so that the reading frame is 
maintained and a functional protein is produced. 

A cell has been "transformed" or "transfected" with exogenous or heterologous 
DNA when such DNA has been introduced inside the cell. The transforming DNA 
may or may not be integrated (covalently linked) into the genome of the cell. In 
25 prokaryotes. yeast, and mammalian cells for example, the transforming DNA may be 
maintained on an episomal element such as a vector or plasmid. With respect to 
eukaryotic cells, a stably transfonned cell is one in which the transforming DNA is 
inherited by daughter cells through chromosome replication. This stability is 
demonstrated by the abiHty of the eukaryotic cell to establish cell lines or clones 
30 comprised of a population of daughter cells containing the transforming DNA. A 
"clone" is a population of cells derived from a single cell or ancestor by mitosis. A 
"cell line" is a clone of a primary cell that is capable of stable growth in vitro for many 
generations. An organism, such as a plant or animal, that has been transformed with 
exogenous DNA is termed "transgenic". 
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Transfomiation of prokaryotic cells may be accomplished by a variety of 
means known in the art. including the treatment of host cells with CaCb to make 
competent cells, electroporation, etc. Transfection of eukaryotic cells may be 
accomplished by a variety of means known in the art, including calcium phosphate- 
DNA co-precipitation, DEAE-dextran-mediated transfection, polybrene-mediated 
transfection, electroporation. microinjection, liposome fusion, lipofection, protoplast 
fusion, retroviral infection, and biolistics. 

As used herein, the term "host" is meant to include not only prokaryotes. but 
also eukaryotes. such as yeast, plant and animal cells. A recombinant DNA molecule 
or gene can be used to transform a host using any of the techniques commonly known 
to those of ordinary skill in the art. Prokaryotic hosts may include E. coli, S. 
tymphimurium, Serraiia marcescens and Bacillus subtilis. Eukaryotic hosts include 
yeasts such as Saccharomyces ceremiae, Schizosaccharomyces pombe, Pichia 
pastoris, mammalian cells and insect cells, and. plant cells, such as Arabidopsis 
thaliana and Tobaccum nicotiam. 

As used herein, the terms "restriction endonucleases" and "restriction 
enzymes" refer to bacterial enzymes, each of which cut double-stranded DNA at or 
near a specific nucleotide sequence. 

"Recombinant DNA technology" refers to techniques for uniting two 
heterologous DNA molecules, usually as a result of in vitro ligation of DNAs from 
different organisms. Recombinant DNA molecules are commonly produced by 
experiments in genetic engineering. Synonymous tenns include "gene splicing", 
'•molecular cloning" and "genetic engineering" The product of these manipulations 
results in a "recombinant" or "recombinant molecule". The term "recombinant 
protein" or "recombinant polypeptide" as used herein refers to a protein molecule 
which is expressed from a recombinant DNA molecule. 

The ribose sugar is a polar molecule, and therefore, DNA is referred to as 
having a 5' to 3', or 5' to 3', directionality. DNA is said to have "5' ends" and "3' 
ends" because mononucleotides are reacted to make oligonucleotides in a mamier such 
that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen 
of its neighbor via a phosphodiester linkage. TTierefore. an end of an oligonucleotide 
is referred to as the «5' end" if its 5' phosphate is not linked to the 3' oxygen of a 
mononucleotide pentose ring and as the «3' end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic 
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acid sequence, even if internal to a larger oligonucleotide, also has a 5' to 3' 
orientation. In either a linear or circular DNA molecule, discrete elements are referred 
to as being "upstream" or "5"' of the "downstream" or "3"' elements. This 
terminology reflects the fact that DNA has an inherent 5' to 3' polarity, and 
5 transcription typically proceeds in a 5' to 3' fashion along the DNA strand. The 
promoter and enhancer elements which direct transcription of an operably Unked 
coding region, or open reading frame, are generally located 5'. or upstream, of the 
coding region. However, enhancer elements can exert their effect even when located 
3' of the promoter and coding region. Transcription termination and polyadenylation 
10 signals are typically located 3' or downstream of the coding region. 

The 3' end of a promoter is said to be located upstream of the 5' end of a 
sequence-specific recombinase target site when, moving in a 5' to 3' direction along 
the nucleic acid molecule, the 3' terminus of a promoter precedes the 5' end of the 
sequence-specific recombinase target site. When the acceptor construct is intended to 
15 permit the expression of a translation fusion, the 3' end of the promoter is located 
upstream of both the sequences encoding the amino-terminus of a fusion protein and 
the 5' end of the sequence-specific recombinase target site. Thus, the sequence- 
specific recombinase target site is located within the coding region of the fusion 
protein (i.e. located downstream of both the promoter and the sequences encoding the 

20 affinity domain, such as Gst). 

As used herein, the term "adjacent", in the context of positioning of genetic 
elements in the constructs, shall mean within about 0 to 2500, sometimes 0 to 1000 bp 
and sometimes within about 0 to 500, 0 to 400, 0 to 300 or 0 to 200 bp. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 

25 transcribed and translated into a polypeptide in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
detennined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA. genomic DNA sequences bom 

30 eukaryotic (e.g.. mammalian) DNA, and even synthetic DNA sequences. A 

polyadenylation signal and transcription termination sequence will usually be located 
3- to the coding sequence. A "cDNA" is defmed as copy-DNA or complementary- 
DNA, and is a product of a reverse transcription reaction fi^m an mRNA transcript. 
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An "exon" is an expressed sequence transcribed from the gene locus, whereas an 
"intron" is a non-expressed sequence that is from the gene locus. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, polyadenylation signals, terminators, and the 

5 like, that provide for the expression of a coding sequence in a host cell. A "cis- 

element" is a nucleotide sequence, also tenned a "consensus sequence" or "motif', that 
interacts with proteins that can upregulate or downregulate expression of a specific 
gene locus. A "signal sequence" can also be included with the coding sequence. This 
sequence encodes a signal peptide, N-terminal to the polypeptide, that communicates 

10 to the host cell and directs the polypeptide to the appropriate cellular location. Signal 
sequences can be found associated with a variety of proteins native to prokaryotes and 
eukaryotes. 

A "promoter sequence" is a DNA regulatory region capable of binding KNA 
polymerase in a cell and initiating transcription of a downstream (3" direction) coding 
15 sequence. For purposes of defining the present invention, the promoter sequence 

includes, at its 3' terminus, the transcription initiation site and extends upstream (in the 
5- direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site, as well as protein binding domains 
20 (consensus sequences) responsible for the binding of RNA polymerase. Eukaryotic 
promoters often, but not always, contain "TATA" boxes and "CAT" boxes. 

Efficient expression of recombinant DNA sequences in eukaryotic cells 
requires expression of signals directing the efficient temiination and polyadenylation 
of the resulting transcript. Transcription termination signals are generally found 
25 downstream of the polyadenylation signal and are a few hundred nucleotides in length. 
As used herein, "an origin of rephcation" or "origin" refers to any sequence 
capable of directing replication of a DNA constract in a suitable prokaryotic or 
eukaryotic host (e.g.. the ColEl origin and its derivatives; the yeast 2 n origin). 
Eukaryotic expression vectors may also contain "viral replicons" or "origins of 
30 replication". Viral replicons are viral DNA sequences which allow for the 

extrachromosomal replication of a vector in a host cell expressing the appropriate 
replication factors. Vectors which contain either the SV40 or polyoma virus origin of 
replication replicate to high copy number (up to 10* copies/cell) in cells that express 
the appropriate viral T antigen. Vectors which contain the replicons fi-om bovine 
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papillomavirus or Epstein-Barr virus replicate extrachromosomally at low copy 

number (-100 copies/cell). 

As used herein, the terms "nucleic acid molecule encoding", "DNA sequence 
encoding", and "DNA encoding" refer to the order or sequence of 
5 deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
deoxyribonucleotides determines the order of amino acids along the polypeptide 
(protein) chain. The DNA sequence thus codes for the amino acid sequence. 

As used herein, the term "gene" means the deoxyribonucleotide sequences 
comprising the coding region of a structural gene, i.e., the coding sequence for a 
10 protein or polypeptide of interest, including sequences located adjacent to the coding 
region on both the 5' and 3' ends for a distance of about 1 kb on either end, such that 
the gene corresponds to the length of the full-length mRNA. The sequences which are 
located 5' of the coding region and which are present on the mRNA are referred to as 
5' non-translated sequences. The sequences which are located 3' or downstream of 
15 the coding region and which are present on the mRNA are referred to as 3' non- 
translated sequences. The term "gene" encompasses both cDNA and genomic forms 
of a gene. A genomic form or clone of a gene contains the coding region interrupted 
with non-coding sequences termed "introns" or "intervening regions" or "intervening 
sequences". Introns are segments of a gene which are transcribed into heteronuclear 
20 RNA (hnRNA); introns may contain regulatory elements such as enhancers, hitrons 
are removed or "spliced out" from the nuclear or primary transcript; introns therefore 
are absent in the mature messenger RNA (mRNA) transcript. The mRNA functions 
during translation to specify the sequence or order of amino acids in a nascent 
polypeptide. 

25 In addition to containing introns. genomic forms of a gene may also include 

sequences located on both the 5' and 3' end of the sequences which are present on the 
RNA transcript. These sequences are referred to as "flanking" sequences or regions 
(these flanking sequences are located 5' or 3' to the non-translated sequences present 
on the mRNA transcript). The 5' flanking region may contain regulatory sequences 

30 such aspromoters and enhancers which control or influence the transcription of the 
gene. The 3' flanking region may contain sequences which direct the termination of 
transcription, post-transcriptional cleavage and polyadenylation. 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. For example, recombinant Cre polypeptides are 
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expressed in bacterial host cells {e.g.. as a GST-Cre or (HN)6-Cre fusion protein) and 
the Cre polypeptides are purified by the removal of host cell proteins; the percent of 
recombinant Cre polypeptides is thereby enriched or increased in the sample. 

As used herein the term "portion" refers to a fraction of a sequence, gene or 
protein. "Portion" may comprise a fraction greater than half of the sequence, gene or 
protein, equal to half of the sequence, gene or protein or less than half of the sequence, 
gene or protein. Typically as used herein, two or more "portions" combine to 
comprise a whole sequence, gene or protein. 

As used herein, the term "fusion protein" refers to a chimeric protein 
containing a protein of interest joined to an exogenous protein fragment. The fusion 
partner may enhance solubility of the protein of interest as expressed in a host cell, 
may provide an affinity tag to allow purification of the recombinant fusion protein 
from the host cell or culture supernatant, or both. If desired, the fiision protein may be 
removed from the protein of interest by a variety of enzymatic or chemical means 
known to the art. 

nRRCRIPTION OF THE SPEr iFir FMRODIMENTS 
Methods are provided for producing an expression vector, hi the subject 
methods, donor and acceptor vectors are combined in the presence of a recombinase to 
produce an expression vector that includes a first and second recombinase recognition 
site oriented in the same direction. . In the subject methods, one of the donor and 
acceptor vectors includes a single recombinase recognition site while the other 
includes two recombinase recognition sites. Also provided are compositions for use in 
practicing the subject methods, including the donor and acceptor vectors themselves, 
as well as systems and kits that include the same. The subject invention finds use in a 
variety of different applications. 

Before the subject invention is further described, it is to be understood 
that the invention is not liniited to the particular embodiments of the 
invention described below, as variations of the particular embodiments may 
be made and stiU fall v^dthin the scope of the appended claims. It is also to be 
understood that the terminology employed is for the purpose of describing 
particular embodiments, and is not intended to be limiting. Instead, the scope 
of the present invention will be established by the appended claims. 
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In this specification and the appended claims, the singular forms "a," 
"an," and "the" indude plural reference unless the context clearly dictates 
otherwise. Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as commonly understood to one of ordinary 
skill in the art to which this invention belongs. 



Methods 



As summarized above, the subject invention provides recombinase-based 
methods for producing expression vectors. More specifically, the subject invention 
provides methods for producing expression vectors by combining a donor and acceptor 
vector that each include one or more recombinase recognition sites with a recombinase 
under conditions sufficient for recombinase mediated site specific recombination to 
occur, where such recombination results in the production of an expression vector that 
lacks at least a portion of the donor or acceptor vector from which it is produced, i.e., 
to produce a non-fusion expression vector. 

A feature of the subject invention is that the donor and acceptor vectors must 
be able to recombine in the presence of a suitable recombinase to produce an 
expression vector as described above, where the expression vector lacks at least a 
portion of the initial donor or acceptor vector, i.e., it is a non-fusion expression vector. 
As such, the donor and acceptor vectors must be able to participate in a recombination 
event that is other than a fusion event, where by fusion event is meant an event in 
which two complete vectors are fused in their entirety into one fused vector, e.g., 
where two plasmids are fused together to produce one plasmid that includes all of 
material from the initial two plasmids, i.e., a fusion plasmid. As such, the subject 
methods are not fusion methods, where such methods are defined as those methods in 
which a single vector is produced from two or more initial vectors in their entirety, 
such that all of the initial vector material of each parent vector, e.g.. plasmid, is present 
in its entirety in the resultant fusion vector. 

The donor and acceptor vectors are further characterized in that one of the 
donor and acceptor vectors includes only one recombinase recognition site, while the 
other of the donor and acceptor vectors includes two recombinase recognition sites. In 
a first preferred embodiment, the donor vector includes two recombinase recognition 
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Sites while the acceptor vector includes a single recombinase recognition site. In an 
alternative embodiment, the donor vector includes a single recombinase recognition 
site while the acceptor vector includes two recombinase recognition sites. The donor 
and acceptor vectors of this first, preferred embodiment and this second, alternative 
embodiment, are described in greater detail below. 

The donor and acceptor vectors described generally above may be linear or 
circular, e.g.. plasmids, and in many embodiments of the subject invention are 
plasmids. Where the donor and acceptor vectors are plasmids, the donor and acceptor 
vectors typically range in length from about.2 kb to 200 kb. usually from about 2 kb 
to 40 kb and more usually from about 2 kb to 10 kb. 

The donor and acceptor vectors are further characterized in that all ofthe 
recombinase recognition sites on the donor and acceptor vectors must be recognized 
by the same recombinase and should be able to recombine with each other, but within 
this parameter they may be the same or different, but in many embodiments are 
usually the same. Recombinase recognition sites, i.e., sequence-specific recombinase 
target sites, of interest include: Cre recombinase activity recognized sites, e.g.. loxP, 
loxP2. loxPSll. 10XP514. loxB. loxC2. loxL, loxR, loxA86, loxAlH; att, dif; frt; and 
the like. The particular recombinase recognition site is chosen, at least in part, based 
on the nature ofthe recombinase to be employed in the subject methods. 

The Donor Vector 

As mentioned above, in a preferred embodiment ofthe subject methods, the 
donor vector includes two recombinase recognition sites while the acceptor vector 
includes a single recombinase recognition site. In the donor vector of these 
embodiments, the donor vector includes two recombinase recognition sites, capable of 
recombining with each other, c.g.. site 1 A and site IB. that flank or border a first or 
donor domain, i.e., desired donor fragment, where this domain is the portion ofthe 
vector that becomes part of the expression vector produced by the subject methods, 
nielengthofthedonordoraainmay vary, but in many embodiments ranges from 1 kb 

to 200 kb. usually from about 1 kb to 10 kb. The portion ofthe donor vector that is not 
part of this donor domain, i.e.. the part that is 5' of site lA and 3' of site IB. is refenred 
to herein for clarity as the non-donor domain of the donor vector. 
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The two recombinase recognition sites of the donor vector are characterized in 
that they are oriented in the same direction and are capable of recombirung with each 
other By oriented in the same direction it is meant that they have the same head to taal 
orientation. Thus, the orientation of she 1 A is the same as the orientation of sUe IB. 

5 The donor domain flanked by the two recombinase recognition sites, i.e.. the 

portion of the vector 3' of the first recombinase site 1 A and 5' of the second 
recombinase site IB. includes at least the following components: (a) at least one 
restriction site and (b) at least a portion of a selectable marker. e.g. a coding sequence, 
apromoter, or a complete selectable marker made up of a coding sequence and a 

10 promoter.Thedonordomainmayincludeatleastonerestrictionsiteorapluralityof 

distinct restriction sites, e.g., as found in a multiple cloning site or polylinker, where 
by restriction site is meant a stretch of nucleotides that has a sequence that .s 
recognized and cleaved by a restriction endonuclease. Where a plurality of restriction 
sites are present in the donor domain, the number of distinct or different restriction 

15 sites typically ranges from about 2 to 5, usually from about 2 to 13. 

In many embodiments, there are at least two restriction sites, which may or 
rr^y not be identical depending on the particular protocol employed to produce the 
donor plasmid. that flank a nucleic acid which is a coding sequence for a protem of 
interest, where the protein of interest may or may not be known, e.g., it may be a 

20 knowncodingsequenceforaknownproteinorpolypeptideoracodingsequencefor 

an as yet unidentified protein or polypeptide, such as where this nucleic acid of interest 
is a constituent of a library, as discussed in greater detail below. The length of this 
nucleic acid of interest nucleic acid may vary greatly, but generally ranges from about 
18 bp to 20 kb. usually from about 100 bp to 10 kb and more usually from about 1 kb 
25 to 3 kb. At least one restriction site and this nucleic acid of interest nucleic acid, when 
present, are sufficiently close to the 3' end of the first flanking recombinase site, i.e.. 
recombinase recognition site 1 A. such that in the expression vector produced from the 
donor plasmid. expression of the coding sequence of the nucleic acid of interest is 
drivenby apromoter positioned 5' of this first recombinase site. As such, the distance 

30 separating this restriction site/nucleic acid of interest nucleic acid from the 

recombinase site typically ranges from aboutlbp to 150 bp. usually from aboutlbp 

to 50 bp. „ . , ^ 

ta a first preferred embodiment, the donor domain also generally mcludes a 
portion of a selectable marker. By portion of a selectable marker is meant a sub-part of 
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a selectable marker, e.g. a coding sequence or a promoter, which can be joined with a 
second subpart to produce a functioning selectable marker that confers some selectable 
phenotype on the host cell in which the expression vector produced by the subject 
methods is to be propogated. Examples of subparts of selectable markers are coding 
sequences and promoters. As such, in many embodiments, the portion of the selectable 
marker present on the donor domain is a coding sequence of a marker gene or a 
promoter capable of driving expression of the coding sequence of the marker gene, 
where in certain prefcired embodiments, the coding sequence of a mailcer gene is the 
portion of the selectable marker present on the donor domain. Examples of coding 
sequences of interest include, but are not limited to, the coding sequences ftom the 
following marker genes:_the chloramphenicol resistance gene, the ampicilUn 
resistance gene, the tetracycline resistance gene, the kanamycin resistance gene, the 
streptomycin resistance gene and the SacB gene from B. subtilis encoding sucrase and 
conferring sucrose sensitivity, and the like. The promoter portions or sub-parts of this 
selectable marker are any convenient promoters capable of driving expression of the 
selectable marker in the expression vector produced by the subject methods, see infra, 
and in many embodiments are bacterial promoters, where particular promoters of 
interest include, but are not limited to: the AmpicilUn resistance promoter, the 
inducible lac promoter, the tet-inducible promoter from pProTet (Piteto-i)- available 
from CLONTECH, T7, T3, and SP6 promoters; and the like. The distance of this sub- 
part or portion of the selectable marker fitJm the 3' end of the second recombinase 
recognition site, i.e., site IB, is sufficient to provide for expression of the marker to 
occur in the final expression vector, where the other part of selectable marker that is 
required for efficient expression of the selectable marker is present on the other side, 
i.e., the 5' side of the adjacent recombinase recognition site. This distance typically 
ranges from about 1 bp to 2.5 kb, usually from about 1 bp to 500 bp. 

The length of the donor domain flanked by the first and second recombinase 
sites of the donor plasmid, i.e., the length of the desired donor fragment, may vary 
greatly, so long as the above described components are present on the donor domain. 
Generally, the length is at least about 100 bp. usually at least about 500 bp and more 
usually at least about 900 bp, where the length may be as great as 100 kb or greater, 
but generally does not exceed about 20 kb and usually does not exceed about 10 kb. 
Typically, the length of the donor domain ranges from about 100 bp to 100 kb, usually 
from about 500 bp to 20 kb and more usually from about 900 bp to 10 kb. 
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In addition to the above described components, the donor vector may include a 
number of additional elements, where desired, that are present on the non-donor 
domain or non-desired donor fragment of the donor vector. For example, the non- 
donor domain generally includes an origin of rephcation. This origin of replication 
may be any convenient origin of replication or ori site, where a number of on sites are 
known in the art. where particular sites of interest include, but are not limited to: 
ColEl and its derivatives, pMBl . other origins that function in prokaryotic cells, the 
yeast 2 micron origin and the like. Also present on this non-donor domain of certain 
prefeired embodiments is a selective marker gene that provides for negative selection 
of the non-donor domain under particular conditions, e.g., negative selection 
conditions. This marker is folly fimctional and therefor is made up of a coding 
sequence operably linked to an appropriate promoter, i.e.. is provided by a functional 
expression module or cassette. Markers of interest that are capable of providing for 
this negative selection include, but are not limited to: SacB. providing sensitivity to 

suCTOse; ccdB; and the like. 

This non.donor domain of the donor vector may further include one or more 
additional components or elements that impart additional fonctionality to the donor 
vector. For example, the donor vector may be a vector that is specifically designed for 
use in conjunction with a yeast two hybrid assay protocol, e.g., such that one can 
detennine whether the gene of interest present in the donor domain encodes a product 
that binds to a second protein prior to transferal of the gene of interest to an expression 
vector. In such embodiments, the non-donor domain typically includes the following 
. additional elements: yeast origins of replication, e.g.. the yeast 2 micron origin; yeast 
selection markers, e.g.. URA3, Leu, and trp selection markers; and peptide fragments 
of yeast transcription factors that are expressed as translational fosions to tiie gene 
encoded within the donor-domain; where yeast two hybrid systems are known to those 
of skill in the art and described in: Fields. S. and 0-K. Song. 1989. A novel genetic 
system to detect protein-protein interactions. Nature 340:245-246; Fields. S. and R. 
Stemglanz. 1994. The two-hybrid system: an assay for protein-protein interactions. 
Trends Genet. 10: 286-292 and the MATCHMAKER system IE user manual, 
available from CLONTECH. In other embodiments, the non-donor domain main 
contain yet other fonctional elements that provide specific fonctions to the donor. For 
example. Donor vectors can be designed that would also function as prokaryotic 
expression vectors that express the gene of interest encoded on the donor domain in 
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prokaryotic cells either as a native protein or fused to an affinity or epitope tag. Such 
vectors may include the following elements in their non-donor domain: inducible 
bacterial promoters, such as the lac promoter or the P„ert)-i promoter; affinity or 
epitope tags, e.g., GST, 6x(HN), myc-tag, HA-Tag, GFP and its derivatives. Donor 
vectors designed to function as retroviral vectors would additionally include retroviral 
LTRs and packaging signals in the non-donor domain. Donor vectors for expression in 
mammalian cells might also encode affinity or epitope tags, e.g., GST, 6x(HN), myc- 
tag, HA-Tag, GFP and its derivatives; and mammalian constitive or inducible 
promoters, e.g., the CMV promoter, the tet-inducible promoter, the TK promoter; viral 
promoters, e.g., T7, T3, SP6. In a preferred embodiment of this particular embodiment 
of the subject invention, the donor vector is as follows. The donor-partial selectable 
marker comprises the open reading frame (ORF) for a selectable marker gene, and is 
placed between the two donor sequence-specific recombinase target sites, adjacent to 
the second-donor sequence-specific recombinase target site. In a more preferred 
embodiment of the donor construct, the open reading frame of the selectable marker is 
situated such that its 5' to 3' orientation is opposite that of the two donor sequence- 
specific recombinase target sites. 

In another embodiment of the donor construct, the donor construct is a closed 
circle {e.g., a plasmid or cosmid) comprising, in addition to the two donor sequence- 
specific recombinase target sites, the unique restriction site or polylinker and the 
selectable marker gene open reading frame, at least one origin of replication, and at 
least one donor-functional selectable marker gene. The methods of the present 
invention should not be limited by the origin of replication selected. For example, 
origins such as those found in the pUC series of plasmid vectors or of the pBR322 
plasmid may be used, as well as others known in the art. Those skilled in the art know 
that the choice of origin depends on the application for which the donor construct is 
intended and/or the host strain in which the construct is to be propagated. 

A variety of selectable marker genes may be utiUzed, either for the donor- 
partial selectable maikcr or for the donor-functional selectable marker, and such genes 
may confer either positive- or negative-resistance phenotypes; however, the donor- 
partial and the donor-fimctional selectable marker genes should be different from one 
another. In a preferred embodiment, the selectable markers are selected from the 
group consisting of the chloramphenicol resistance gene, the ampicillin resistance 
gene, the tetracycline resistance gene, the kanamycin resistance gene, the streptomycin 
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resistance gene and the sacB gene from B. subtilis encoding sucrase and confemng 
sucrose sensitivity. In a more preferred embodiment, the donor-partial selectable 
marker is a portion of the gene {e.g., the open reading frame) for chloramphenicol 
resistance and the donor-fianctional selectable marker gene is the gene for ampicillin 
resistance . In another preferred embodiment of the donor construct, the origin of 
replication and the donor-fimctional selectable marker gene lie 5' of the first-donor 
sequence-specific recombinase target site. 

In another embodiment of the present invention, there is provided a donor 
construct with all the above-described features, but additionally having a marker gene 
different from either the donor-fimctional selectable marker gene or the donor-partial 
selectable marker gene, wherein the additional marker gene is positioned 5' of the first 
sequence-specific recombinase target site such that upon combination with a 
recombinase, the additional marker gene is located on the undesired second donor 
fragment. This marker gene provides an additional screen to exclude any products that 
result in recombinants containing the second donor fragment. The marker gene could 
be, for example, LacZ. In this case, inconect recombinants would generate blue 
colonies on X-Gal plates. Alternatively, a more preferred additional marker would be 
the sacB gene conferring sucrose sensitivity, hi this case, any incorrect clones would 
be killed when grown on sucrose containing medium. The additional marker provides 
another screen, thereby enhancing the system by fiirther ensuring that only correct 
recombination products are obtained following recombination and transformation. 

In yet another embodiment of the donor construct, the donor construct fiirther 
comprises a termination sequence placed 3' of the restriction site or polylinker 
sequence but 5» of the second-donor sequence-specific recombinase target site. In a 
most preferred embodiment, the termination sequence is placed 5' of the 3' end of the 
donor-partial selectable marker {e.g. the ORF of the selectable marker gene in the 
preferred embodiment which is in the 5' to 3' orientation opposite that of both donor 
sequence specific recombiaase target sites). The present embodiment is not be limited 
by the teraiination sequence chosen. In one embodiment, the termination sequence is 
the Tl termination sequence; however, a variety of termination sequences are known 
to the art and may be employed in the nucleic acid constructs of the present invention, 
including the T6S. TINT. TLl, TL2. TRl, and TR2 termination signals derived from 
the bacteriophage lambda, and termination signals derived from bacterial genes such 
as the trp gene off. coli. 
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m another preferred embodiment of the donor construct, the donor construct 
fa ther comprises a polyadenylation sequence placed 3' of the unique restriction site(s) 
or polylinker but 5' of the second-donor sequence-specific recombinase target site. In 
a most preferred embodiment, the polyadenylation sequence is placed 5' of the 3' end 
of the open reading frame of the selectable marker gene similar to the placement 
described for the tennination sequence supra. The present invention should not be 
limited by the nature of the polyadenylation sequence chosen. In one embodmient. the 
polyadenylation sequence is selected from the group consisting of the bovine growth 
hormone polyadenylation sequence, the simian virus 40 polyadenylation sequence and 
the Herpes simplex virus thymidine kinase polyadenylation sequence. 

Also, in a preferred embodiment, the donor construct further comprises a gene 
or DNA sequence of interest inserted into the unique restriction enzyme site or 
polylinker The present invention should not be limited by the size of the DNA of 
interest inserted into the unique restriction site or polylinker nor the source of DNA 
15 {eg, genomic libraries, cDNA libraries, etc.). 

' Thus, in a most preferred embodiment of the donor nucleic acid construct, 
there is provided, in 5' to 3' order: a) a first-donor sequence-specific recombinase 
target site; b) a nucleic acid or gene of interest; c) termination and polyadenylation 
sequences; d) an open reading fi^e for a selectable marker gene in a 5' to 3' 
20 orientation opposite to that of the first-donor sequence-specific recombinase target 
site- e) a second-donor sequence-specific recombinase target site in the same 5' to 3 
orientation as the first donor sequence-specific recombinase target site, wherem the 
second-donor sequence-specific recombinase target site is able to recombine with said 
first-donor sequence-specific recombinase target site; f) an origin of replication; and g) 
25 a donor-fimctional selectable marker gene. 

As mentioned above, in an alternative embodiment of the subject invention, the 
donor vector employed in the subject methods includes only a single recombinase 
recognition site, while the acceptor vector, described in greater detail below, includes 
two recombinase recognition sites. In this embodiment, the donor vector includes: a) a 
30 donor partial selectable marker element; b) one sequence-specific recombinase target 
site with a defined 5' to 3' orientation; and c) a unique restriction enzyme site or 
polylinker, said restriction enzyme site or polylinker being located 3' of the sequence- 
specific recombinase target site. The donor partial selectable maricer element must be 

placed in said donor construct so that when the donor construct later recombines with 
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the acceptor construct, a functional selectable marker is fonned in the resulting final 
recombination product. In a preferred embodiment of this alternative embodiment, the 
donor partial selectable marker element comprises the open reading firame (ORF) for a 
selectable marker gene placed adjacem to the sequence-specific recombination site 
such that its 5' to 3' orientation is opposite to that of the sequence-specific 
recombination site, hi addition, in this preferred embodiment of the alternative 
embodiment of the donor construct, the donor construct is a closed circle {e.g.. a 
plasmid or cosmid) comprising, in addition to said sequence-specific recombinase 
target site, said unique restriction site or polyHnker and said selectable marker gene 
open reading frame, an origin of replication capable of rephcating the final 
recombination construct, a functional selectable marker gene driven by a promoter, a 
prokaryotic termination sequence placed 3 ' of the restriction site or polylinker 
sequence and a eukaryotic polyadenylation sequence placed 3' of the restriction site or 
polylinker. Also, in a preferred embodiment of the alternative embodiment, the donor 
constract further comprises a gene or DNA sequence of interest inserted into the 
unique restriction enzyme site or polyHnker. The present invention should not be 
limited by the size of the DNA of interest inserted into the unique restriction site or 
polylinker. 

Figures 2A to 2C provide schematic representations of three different 
representative specific donor vectors, specifically donor plasmids, of the subject 
invention, i.e.. pDNRl; pDNR2 and pDNRB. Each of these specific representative 
vectors includes two loxP sites oriented in the same direction. Also present in each of 
these specific donor plasmids is the chloramphenicol resistance open reading frame 
and a multiple cloning site, which elements are flanked by the lox P sites and are 
present on the part of the plasmid that is incorporated into the final expression vector 
upon practice of the subject methods. Also present on each of the donor plasmids are 
two selectable markers on the portion of the plasmid that is not incorporated into the 
final expression vector, i.e.. Amp' and SacB. These three specific donor plasmids 
differ from each other with respect to the multiple cloning site, and specificaUy the 
open reading frame of the multiple cloning site, as shown in Fig. IB. Yet another 
specific donor vector of interest is the p-DNR-Lib vector, shown in Fig. 2D. In this 
vector only one selectable marker is present on the portion of the plasmid that is not 
incoiporated into the final expression vector, e.g., SacB. 
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The Acceptor Vector 
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AS menlioned above, in a preferred embodin^ent of the subject invention, the 
acceptor vector employed in the subjectmethods isavector that includesasin^^^^ 
^ombinase site, in these en,bod,.entMhesinglerecombinase site is flanked onone 

side by a promoter and on the other side, m certain preferred embodiments, by a 
portion ofaselectablemarker,e.g.,apromoteroracodmg sequence, where inmany 

preferred embodiments described forther below, this portion or sub-part of the 
selectablemarkerisasecondpromoter,e.8.,abacterialpromoter. Inthese 
embodiments, the sbglerecombinase site is flankedbytwooppositelyonented 
promoters, where oneofpromoters drives expressionofthegeneofinterestmthe 
expressionvectorproducedbythesubjectmethods and the second promoter dnves 
expressionofthe coding sequenceoftherecombinant-f^nctionalselectablem.^^^^^^ 
the expression vectorproducedbythe subject methods, h. these embodiments, the first 
promoter isapromoter that is capable of driving expressionofthegeneofinterestm 
the expression vector, where representative promotersinclude. but arenot limited to 
theCMVpromoter.thetet-induciblepromotenretroviralLTRpromoter/enhanc. 

sequences, the TK promoter, bacterial promoters, e.g. the lac promoter . the Pueto-. 
promoter; the yeast ADH promoter and the like. The distance between thefirst 

promoter and the recombinase site is one that allows for expression in the final 
expression vector, where the distance typicallyranges from aboutlbptolOOO bp. 

usuaUy from aboutlObp to 500bp.Tl>esecondpromoter isapromoter that.capab^ 

ofdrivingexpressionoftherecombinant-functionalselectabIemarker.andis 

abacterialpromoter.Bacterialpromotersofinterestinclude.but are not limnedto:th^ 

A„,picillinpromoter.thelacpromoter.thePu^.promoter.theT7promoterandthe 

like TTiedistancebetweenthcbacterialpromotcr and therecombinase site is suffiaent 

to pn,vide for expression of the selectable marker in the expression vector and 
typically ranges from about 1 bp to 2.5 kb, usually from about 1 bp to 200 bp. 

AS indicated above, in yet other preferred embodiments the acceptor vector 

, lacks the portion or subpart of the selectable marker. In these embodiments, the 
acceptorvectormay be used withadonorvector that includesacompletepo.Uve 

selectablemarker in the desired donor fragment flankedbythetworecombinase sues, 
ie thedonorvectorportionlocatedbetweentheS'endofthefirstrecombinasesUe 
and the 5' end of the second recombinase site. Alternatively, the acceptor vector may 
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be used with a donor vector that only includes a partial selectable positive marker, as 
described above, where the partial marker is nonetheless factional in the resultant 
expression vector. 

The acceptor vector of the embodiments described above may include a 
number of additional components or elements which are requisite or desired depending 
on the nature of the expression vector to be produced from the acceptor vector. In 
many embodiments of the subject invention, the acceptor vector is an acceptor nucleic 
acid construct comprising: a) an origin of replication capable of replicating the final 
desired recombination construct or expression vector; b) an acceptor sequence-specific 
recombinase target site having a defined 5' to 3' orientation; c) a first promoter 
adjacent to the 5' end of the acceptor sequence-specific recombinase target site; and d) 
an acceptor-partial selectable marker, wherein the acceptor-partial selectable marker is 
capable of recombining with a donor-partial selectable marker from a donor construct 
(or first donor fragment, once the donor construct is resolved) so creating a 
recombinant-fimctional selectable marker in a final desired recombination construct. 
As in the donor construct, the acceptor construct is not limited by the nature of the 
sequence-specific recombinase target site employed, and in preferred embodiments the 
sequence-specific recombinase target site may be selected from the group consisting of 
loxP, loxP2, loxPSl 1, 10XP514, loxB, loxC2. loxL, loxR. loxA86, loxA117, loxP3, 
loxP23, att, dif. and frt. The acceptor sequence-specific recombinase target site from 
the accqjtor construct does not have to be identical to those on the donor construct; 
however, the sequence-specific recombinase target sites on the acceptor and donor 
constrocts must be able to recombine with each other. 

In a preferred embodiment, the acceptor-partial selectable marker is a second 
promoter, wherein the second promoter is oriented such that its 5' to 3' orientation is 
opposite that of the acceptor sequence-specific recombinase target site and the first 
promoter, and wherein the 3' end of the second promoter is adjacent to the 3' end of the 
acceptor sequence-specific recombinase target site. 

The acceptor construct is not limited by the nature of the origin of replication 
employed. A variety of origins of rephcation are known in the art and may be 
employed on the acceptor nucleic acid constructs of the present invention. Those 
skilled in the art know that the choice of origin depends on the appHcation for which 
the acceptor construct is intended and/or the host strain in which the construct is to be 
propagated. In the case of the acceptor construct, the origin of replication is chosen 
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appropriately such that both the acceptor construct and the final desired recombination 
constmct will be able to replicate in the given host cell. 

The acceptor construct also is not limited by the namre of the promoters 
employed. Those skilled in the art know that the choice of the promoter depends upon 
the type of host cell to be employed for expressing a gene(s) under the transcriptional 
control of the chosen promoter. A w,de variety of promoters functional m viruses, 
prokaryotic cells and eukaiyotic cells are known in the art and may be employed m the 
acceptor nucleic acid constructs of the present invention. In a preferred embodmient 
of the invention, the donor construct contains a gene or DNA sequences of interest and 
0 when the donor construct recombines with the acceptor construct, the first promoter of 
the acceptor construct is positioned such that it will drive expression of the gene or 
DNA sequences of interest. Thus, a promoter capable of driving the gene or DNA 
sequences of interest should be chosen for the first promoter. Further, in a preferred 
embodiment of the present invention, the acceptor-partial selectable marker is a 
i5 promotercapableofdrivingtheexpressionofthedonor-partialselectablemarkerORF 

from the donor construct (e.g., the promoter for the ampicillin gene from the plasmxd 
pUC19) or a viral promoter including, but not limited to. the T7, T3. and Sp6 
promoters. 

In yet another preferred embodiment of the acceptor construct, the acceptor 
20 construct additionally includes a DNA sequence encoding a peptide affinity domain or 
peptide tag sequence, wherein the affinity domain or tag sequence is 3' of the first 
promoter and 5' of the acceptor sequence-specific recombinase target site, such that 
the expression of the affinity domain or tag sequence is under control of the first 
promoter, and such that it is in the same translational frame as the acceptor sequence- 

25 specificrecombinasetargetsite. The present invention is not limited by the nature of 
the affinity domain or tag sequence employed; a variety of suitable affinity domams 
are known in the art. including glutathione-S-transferase, the maltose binding protem. 
protein A. protein L, polyhistidine tracts, etc.; and tag sequences include, but are not 
limited to the cMyc Tag. the HA Tag, the FLAG tag. Green Fluorescent Protem 

30 (GFP), etc. 

In another preferred embodiment of the acceptor construct, the acceptor 
construct further includes an acceptor-functional selectable marker. Hie present 
invention is not limited by the namre of the acceptor-fimctional selectable marker 
chosen and the selectablemarkergenemayresult in positive or negative selection. In 
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a preferred embodiment, the acceptor-functional selectable marker gene is selected 
from the group consisting of the chloramphenicol resistance gene, the ampicilhn 
resistance gene, the tetracycline resistance gene, the kanamycin resistance gene, the 
streptomycin resistance gene and the sacB gene. 
5 m addition to one or more of the above described components, the acceptor 

vectors may include a number of additional components that impart specific fimction 
to the expression vectors that are produced from the acceptor vector according to the 
subject methods. Additional elements that may be present on the subject acceptor 
vectors include, but are not limited to: (a) elements requisite for generating vectors 
10 suitable for use in yeast two hybrid expression assays, e.g.. a GAL4 activation domam 
coding sequence, a GAL4 DNA-blnding domain coding sequence, (as found in pLP- 
GADT7 and pLP-GBKT7 shown in Figs. 3 A & 3B); (b) elements necessary for study 
of the localization of a protein in a cell, e.g.. tagging elements such as fluorescent 
protein coding sequences, such as the GFP coding sequences (as found in pLP-EGFP- 
15 CI. pLP-ECFP-Cl and pLP-EYFP-Cl shown in Figs. 3C to 3E); (c) elements 

ne^sary for constitutive, bicistronic expression in mammalian cells, e.g.. IRES sites, 
in combination with selectable markers, e.g. antibiotic resistance, fluorescent protein, 
etc (as found in pLP-IRESneo and pLP-IRES2-EGFP shown in Figs. 3F to 3G); (d) 
elements necessary for inducible expression of the gene of interest on an expression 
20 vector. e.g. inducible promoters such as the tet-responsive promoter, etc. (as illustrated 
by PLP-TRE2, pLP-ProTet and pLP-RevTRE. shown in Figs. 3H. 31 and 3K); (e) 
elements that provide for retroviral expression vectors, e.g.. as found in pLP-LNCX 
and pLP-RevTre shown in Figs. 31 and 3 J; and the like. 

Also provided is an alternative acceptor construct embodiment that can be used 
25 with the alternative donor vector described above. In this embodiment, the alternative 
acceptor construct includes: a) an origin of replication; b) a first sequence-specific 
tecombinase target site and a second sequence-specific recombinase target site each 
having a 5' and a 3' orientation, wherein said first and second sequence-specific 
recombinase target sites have the same 5' to 3' orientation and where said first and 
30 second sequence-specific recombinase target sites can recombine with each other and 
with the sequence-specific recombinase target site of said alternative donor construct; 
c) a first promoter element having the same 5' to 3 ' orientation as the sequence- 
specific recombinase target sites and wherein said fu^t promoter element is positioned 
at the 5' end of said second sequence-specific recombination target site; and d) an 
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acceptor partial sel»,ab,= mato elcMcn. whcdn said acceptor partial selcc«*le 
„a*e,=lcn,c„.isc.pableofr— gwiOrsai^onorpartial^lcc*^^ 
c,cm«,. from said a,.en,a.ive donor cons™u«crea,cail««o.alsclec«blen»H.. 

elcm«..inthcfinalrecombina,ionco„s.n.c,. toaprcfe™lcmbodimcn.of.l«s 
a,,en,a,iv=embodi™«,.,said acceptor partial selcc,ablcm-er=lcn,c.,isas«o.^^ 

promoter having a 5' and 3' end, wherein said second promoter is oriented such th« 
its yt. 3' orientation is opposite to that of said acceptor se,nence.sp«.fic 
recombination sites and said first promoter etaent. and wherein the 3' end of sotd 

second pn>mo..r is «.jace„t to the 3' end of the ftrs. se,.e„ce-spedfic r«ombm..K». 
siu Also in a preferred embodhnen. of Ure alternative embodimen. of the acceptor 
^C, the acceptor construct additionallycomprisesaDNA science encodmga 

pep«deafflni.ydomainorpeptid.taBse,.e..e.whereinsaidaffini.,do,^.s«.>^^er 
control ofthe said firs, promoter element »Kl is in the same transMonalfiam. as the 

aeco^l se^uence-speciflc recombinase site. Also, aprefe.ed embodimen. of the 
.llcm^ive embodiment of the acceptor construct Ibrther comphses a tacfonal 
selectable marker gene. 

Expression Vector Generation with a Recombinase 

AS mentioned above, in the subject methods the donor and acceptor vectors are 

contacted with arecombinase under conditions sufficient for site specific 
combination tooccur.specifically under conditions sufficient forarecombmase 

n^ediatedrecombinationeventto occur that produces the desired expression vector. 

where expression vector production is accomplished without cutting or ligauonofme 

donor and acceptorvectorswithrestrictionendonucleases^d nucleic acid hgases. 

TT.e contact may occur under in vitro or in vivo conditions, as is desired and/or 

convenient. . . 

to many embodiments, an aqueous reaction mixmre is pr»luced by combmmg 
are donor and acceptor vectors and the recombinase with water and other re<,u,s«c 
, and/or desired components to produce a reaction mixmre that, tntder appropnate 
conditiom, results in producdon of dte desired expression vector. The vartous 
,^umaybecombi.edsepar..elyorsimnl.aneously.dependingon.h..»»^ 

of the particular comp«.em and how the components are combined. Convemen%, the 
,^,sof«.e,eact,onmixn«a..c«nbinedin.smtablecontainer.Theamount 
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ofaonc ^ acceptor veco. fta, are present m .he reason n.ix»c are s«fficie« >o 
p;raefor«,.dlap,oaucUo„or,.ee,pres.onvee,orpro.»c..wher.O,e»o«nU 

„f.o,»r.ndaecep«.rve«orn«,be*e.a,neordifferen,,bu,are,„.a„y 

of donor andaecepror vector .ha. i^prese.. in mereac„o„™.x«n.ra.^^ 
r.a.ou.50...o.».«sna,...oa..><»n.,oS00.a...ore.^^^^^^ 
al„u.l50n8.o300ntforar^o.vo.«.ran^.6fto».abou.5^Uo.OOO^, 

usuaUy from about 10 ^l to 50 jil. 

n.r.co«bnase.ha.isprese„.in.h.r..c.o„mixnne.one«,a,prov,des(or 

^teataoffl.edonora.d.«ep.orvec.ors.U.oned.u=cogoizes.he 

^„„bina.e,«o^.ionsi.cso„.h.donora„daccep«vec.o..Assoc^*^^ 

^«na.eo,.o^»fl.vary.»he,c,ep.es««a«ve,eco™.~^^^^^ 
^,Wed.o:.ecc,n«„as».™.sposesa„dio»srasea.«bcespec,ficr— es 

„ri..erea.U.c.*b«a,e.o.,«-.o=Crereoo™blnaseC.hecre,e„c.a^bee„ 

c,o„ed».dexpre.edi„ava,ie.yo,tosU,andU«eoz,n,eca.bepur,8^^^^^ 

^o«gen«,yusinga.^d.rd,.ohni,u.sb»™m*.a«..p.^«edCrep,«.e^^ 

leo^.he«she..n.reco»*inaseotb.c,enoph.geLambda«^reco^zes*e 

^^*.difsi,.«>eWpro.*.^»*eW,6»ansposo„;*eTn3re»lv^^. 
„»rreco,*™se;,hcCtareco,.«na«;*ei,nn,»»g.oW»reco*„.3.,«^^ 
like WMle««amountofr=combtaa«prese.lin.hereactio„mix»rema,var, 

td..o.U,ep,r.c.arrec«*asee„..Min-.-^^^ 
Jges»omabo».0.,™u.o.250uni.s.o-iyft™abo«.un...olOorn.and 

S LoLusuaUy^o^aboo.. -u.2«»i.a.f«*e..ovedesc,i..d,e.*onvo,-«^ 
,„,e„.,eae.o.n,.»e^i.eMdead.«o»>oo.,.^^^ 

«,te thereof e R buffering compounds, such as Tns-HCl, Mbi>, 
buffer or components tnereoi, e.g., DU""*"© r 

l.».phosp.a,e«er,aodi»ac«a.e.ui^..M..»«..jW*are^ 

,0 »^.;n,o„ova,en.,o„s.e.g.,sodiu.,ch.o,id^""^«»'»=-**-'"" 

.^onnuran^ngfro^abonno^ "'""^'^^'T- 70^11^^ 
^.diva,e„ua.io„.,e.g,mag..s™.calch»«.d*«'*«.**-°»™'7'' 
r^o»nura.«»o.abou.. n^.aO^M.— yfto...o«5n^ » V^n*.. 

^d other co,npone„u,e.^.BSA.EDTA.sp«™idtac...^«»»>=-.-<»-"" 
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above an,o»„.ra„gesa,ep™vid«ifor*er^U.ive,««io„ volumes d^mbcd 
above), AS the reacUo. mix^res are aqueous reaodon nicmres. U«y also mcWe 

Tte subjec, reaction nJx«rres are twWly p.q«««i « "-P«^ ™eing 
Jom about 0-4'C, e.g., on ice, to mirAni- ^"""""^ 
fixture preparation, the ten^pcrature of the reacticn n.ix»e is .,^ieall, rarsed to a 
temperature that provides for optituun, or tnaxin,.. reoontbit^s. acivi^-an- 

eo.Lita„t,.expressio„vec.orpro*..o„.0«e..in,hispo,1ion^^ 
,e„.pera,„rewinbe,aisedtoa,e„,pera«u.rangi.g<rom.bou.4 C.0 37 

ftom about 1 0 "C to 25 "C , where the ntixUrre »in be u^totained « .Us 
foraperiodoftitrresufftcient for thedesiiedamou^of expression vectorproducoon 

.„ocL,e.g.,foraperiodoftimera.giugfton,abou.5n,i,«.o60mi.s.„s„any»om 
about iO mins to 15 mim. Following the iuc»ba«oup.=riod, the reacdon mtxture . 
aubjeeted to conditions .uffloien. to inactivate dre 

o„hereactionmixmren>.yberaisedto.valuerang,ngfe»nabou.65 C«.70 Cfbr 
aperiodoftimeranginglromabout 5 mins to 10 mins. 

Altentatively, contact of the donor and a«^.orv«»».swi*«»t.cot,>bo»s. 
™y occur in vivo. whe. the donor and a«^.or vec«,rs arc inhx-iucd in a smttW. 
host cell that expressesarccombinase. in .Ms embodin,en.>herecombin.do.bem^ 

He donor a«l a«.cp.or vectors may be accomplished in vi«, using a host ceU th^ 
,„^ently or constitutively expresses the appropriate site-speciSc recombtnase (.f . 
Crerecombinase expressed in thebacterial strain BNNU2. available t»m 
CLOKTECH). pDonor and pAcceptor, i.e., the donor and acceptor vectors 
.especdvely, are co-transfomted into the host cell usirrg a variety of metoidstawn m 
the art(e«.transforma.ionof cells made competent by .reatmen. with CaCh. 

elec.«^radon.e.c.). The er^transformedho^ cells are grown under co»h«o»t 
whichsd.c.for.h.pres««of,herecombi™n..fimctionalselec.abl.m,*ercr»^ 

byrecombi-tfon of pPouor with the pAcceptor (e.». growth irr the p«s«.« of 
chlorampt^colwharthepDonorve^or contains all or partoftheehloramphcnrcol 

, .esisu^geneopenreadingframeandpAcceptormayalsocoutainap^ter 
^ar, for expression of the chloramphenicol op«. ftame). Plasmtd DNA rs 
isoUted Som host cells which grow in the presence of the seleoHv. pressure arKl ts 
anbjeetedtoresoicdon enzyme digestitm to confinn that the desi^recombmado. 

event has occurred. 
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The present invention also provides a method for the in vitro recombination of 
nucleic acid constructs, comprising the steps of: a) providing i) a donor nucleic acid 
construct comprising a donor-partial selectable marker, two donor sequence-specific 
recombinase target sites each having a defined 5' to 3' orientation and wherein the 
donor sequence-specific recombinase target sites are placed in the donor construct 
such that they have the same 5' to 3' orientation, and a unique restriction enzyme site 
or polylinker, the restriction enzyme site or polylmker being located 3' of the first- 
donor sequence-specific recombinase target site and 5' of the second-donor sequence- 
specific recombinase target site; (ii) an acceptor nucleic acid construct comprising an 
origin of replication, an acceptor sequence-specific recombinase target site having a 
defined 5' to 3' orientation, a first promoter adjacent to the 5' end of the acceptor 
sequence-specific recombinase target site, and an acceptor-partial selectable marker, 
wherein the acceptor-partial selectable marker is capable of recombining with the 
donor-partial selectable marker fi-om the donor construct to create a recombinant- 
fimctional selectable marker in a final desired recombination construct; b) contacting 
the donor and acceptor constructs in vitro with a site-specific recombinase under 
conditions such that the desired donor firagment recombines with the acceptor 
construct to form a final desired recombination construct. 

The present invention fiirther provides a method for the recombination of 
nucleic acid constructs in a host, comprising the steps of: a) providing i) a donor 
nucleic acid construct comprising a donor-partial selectable marker, two donor 
sequence-specific recombinase target sites each having a defined 5' to 3' orientation 
and wherein the donor sequence-specific recombinase target sites are placed in the 
donor construct such that they have the same 5' to 3' orientation, and a unique 
restriction enzyme site or polylinker, the restriction enzyme site or polylinker located 
3' of the first-donor sequence-specific recombinase target site and 5' of the second- 
donor sequence-specific recombinase target site; (ii) an acceptor nucleic acid construct 
comprising an origin of replication, an acceptor sequence-specific recombinase target 
site having a defined 5' to 3' orientation, a first promoter adjacent to the 5' end of the 
acceptor sequence-specific recombinase target site, and an acceptor-partial selectable 
marker, wherein the acceptor-partial selectable marker is capable of recombining with 
the donor-partial selectable marker fi-om the donor to create a recombinant-fimctional 
selectable marker in a final desired recombination construct; and iii) a host cell 
expressing a site-specific recombinase; b) introducmg the donor and acceptor 



29 



PCT/USOO/19221 

WO 01/05961 

consm.* tau> .he host cell under co»dMcm ^ *e desired d».»r ftagmen. 
^Mbines with the accep«,r eonsUue. «. *e Brud desired recombinaUm 
oonsmic. »hich is capable of taparling d« «- " 

growth medium. 

5 above meftods of producing expression vectors can be employed to 

^idly produce a plurality of different expression vectors that are disdnet ion, each 
other but cany the sante coding sequence of interest &om a single. orignnU type of 
aonor vector. In other words, the subject methods can be used to rapidly clone a 
nucleic acid of interest fton, an initial vector into a plurality of expression vectors. By 

,0 p,«,.lityismeanta.lcas,2,usuallyatleast5..dn»reusuany..l.as.lO,»he«.be 

n«b.. may be as high as 20, 96 or more. The m«hods can be performed by one 
person in a pedod of time fl,at is a taction of «ha. it would take by .ha. pen»n of slall 
in Ote ar. .0 produce .he same number and variety of expression vectors usmg 
.radiUonal cubing and ligation pt^tocols, where 0,. increase in efSciency obtain«l by 
,5 subject me*ods is a. leas. abou. 6 fold, usually at leas, about 15 fold and more 
usually a. leas. abou. 30 ibid. 

The Resukml Expressim Vector 

,0 Tie above steps result in the production of an expression ve«or ftom donor 

and acc^tor vectors, and more specifically fromaportfonofoneofth.se vectors and 

tbe e^tfrety of dte oto of vec.ors, e.g.. &om a porUon of donor ve«or and 
.he entirety of *. acceptor vector, where by portion is mean. 4e par, of Ure donor 
,«»or to, Ues 3- of d>e firs, donor sequorce-specifc recombinase si.e and 5" of dr. 

ma, vary.dependingontenamreofthe vector. Where Ore vec»risaplasm.d.fte 
sizeofflre^cpre^ionvecrmayrange iron. aboutJkb to 20 kb. usually from .b».4 

1*«>8M). . . ^ ^ 

The «^ expression vector is charaCeHzed in *a. ,. tnclodes Mo 
30 recombinase r«»gnition si,es, i.e., a frs. and second recombinase recogniiton s,tt, 
ori.n,«i in *e same direction. The disunce bemeen ,he tirst and second recombn».. 
sites, .p«»fically dis«nce be«-een tire 3' end of the first recombinase si« and *e 
y end of *e second r«»mbinase si,e, ranges in many embodiments from abon. 100 
bp » .00 kb. usually tan abou, 500 bp to 20 kb, depending on wheUrer tire codmg 
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sequence of a protein of interest or just a restriction site/multiple cloning site, is 
present between the first and second recombinase recognition sites. The portion of the 
vector that lies in this inter recombinase region, i.e. 3 ' of the first recombinase site and 
5' of the second recombinase site, typically makes up from about 2 % to 85%, usually 
5 from about 20% to 60 % of the entire expression vector. 

In many embodiments, the expression vector is further characterized in that 5' 
of the first recombinase site is a first promoter. 3' of the first recombinase site is at 
least one restriction site; and the second recombinase site located inside a functional 
selectable marker, i.e., it is flanked by disparate portions or sub-parts of a selectable 
10 marker expression module or cassette (e.g., a promoter and a coding sequence), where' 
the second recombinase site is present between the two sub-parts of the selectable 
marker in a manner such that the selectable marker is fiinctional, i.e.. the coding 
sequence of the selectable marker is expressed. In other words the expression vector 
includes a selectable marker expression cassette or module made up of a promoter and 
15 coding sequence that flank the second recombinase site. In many embodiments, the 
second recombinase site is flanked by a promoter on its 3' end and a coding sequence 
of the selectable marker on its 5' end. In this embodiment, the first and second 
promoters, located 5' of the first recombinase site and 3' of the second recombinase 
site, respectively, are oriented in opposite directions. 
20 The expression vector is further characterized by having at least one restriction 

site, and generally a multiple cloning site, located between the first and second 
recombinase sites. In many embodiments, located between the first and second 
recombinase sites, and flanked by two restriction sites, which may or may not be the 
same, is anucleic acid of interest, i.e., gene of interest, that includes a coding sequence 
25 for a protein of interest whose expression from the expression vector is desired. In 
these embodiments, the first promoter 5' of the first recombinase site and the coding 
sequence for the protein of interest are arranged on either side of the first recombinase 
site such that they form an expression module or cassette that expresses the encoded 
protein, i.e., the coding sequence and first promoter flank the first recombinase site in 
30 manner such that they are operably linked. 

In addition to the above features, the expression vector fiirther includes at least 
one origin of replication that provides for replication in the host or hosts into which it 
is placed or transformed during use. Origins of replication of interest include, but are 
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„o,.in,i.edto>o» described above in connect «i**e donor and .oce^o, 

vectors. 

b, a specific embodi«nl. .he expression vector or final constmet IS 
ebaracterix^i as follows-Ms final desir^l recombinadon co,«B«c, comprises, rn 
™«bley,o3-order:a)afirs.pron,o.er;b).fi,s.-recomb,na„.se<,.e„ce.spec,fie 

rLmbinase .arge. si.e, wbere,„ ,he 5' end of 0,e firs-reconrbinan, se^ence-spe^* 
rcconrbinas. .arge. si,e is derived fion, U,e 5' end of .he accep.or se,uenc.spec.fic 
,eco,„bi«as..arge.si»f^n,U,eaccep.or.ndU,ere„dof.hefirs.-r.co»b™^^ 

«,«„c.speciflc reoombinase .arge, sire is derived fion, *e e«l of firs.^onor 

„ «,„e.ce.specific,«»n.binase urge. si.e of «,e donor cons.n,c,; c) a nnK^e 

rcsrtcfion enzyme si»= or polylinker; d) U,e donor-partiai sclec^ble mad.^ 0 a 
soco„d-recombinan,se<,nence-specificrecon.bn.ase.arge.si.e,oca.rfw.Onn,he 

^binan.-fimcdona.selecublemarkergene^dad,acen..od,edon.^a«^^^ 
so,ec.abte,na..erand«».ccep.or-pariialselec.b,.n,arlcer,whe.m*e5^^^ 
,5 second.r«ombinan.se,u..ce.speciflcr^mbi™seUrge.si.eisdenvedftomdK5 
^of«.seco.d^onorse,ncnc..specificrecon»binase.arge,s,.efi««.U,etoor 
c„.s«c.=»i.I»3'endofU,esecond-recombi„an.se,ue„ce-speei6erecombm^ 
«rg« site is derived ftom *. 3' end of .he accept, sequence-specific recomb,,^ 
.ar^e. si.e of d« accept, con*... 0 «.e .ccep.«r-p,r.ia, selecUble marker, whe^ 
Jaccep.or-pa,Ua.se..c»blemarke,adjoinsa,edo.or.pariials=.ec.abemancer» 

p^c...ew.,^edrecombi„an.-ta.crionalselecuble marker; and. g) an ongm 

cation* ^ * • « « 

to a preferred embodimen., final desired r^mbinaUon produc, conuuns a 
ge„corDNAse,„enceofin«res.i..ser.edin.od,enni,»er..ricfio„e„^^=^^^ 
L«er sueh«*ege.eorDNAs«,»enceoti„.eres.isunder .he con.ro of«.e 

Lpromo^r. I. »cba.«nbodim««..heg.eormAse,nenceof,n.eres . corned 
„j3-».dof4.ta.-r.oombina«.se,»ence.specificrecombinase.arge.s..esuch 
^,.fim«ional»nscripdo.alum.isfo™=dso.ha.*egeneorDNAse,nenceof 

to.er.s.is.xpr.ss.das.pro.eindrivenb,d«firs.p.mo.erof.heaccep^^^^ 
toamo,epreferredembodime«,0.egeneofin.e,es.is,o.nedtod.e3e„d ffcefi,^ 

^binan.se,u««e^«crecombinase«rge.si.esnch*a.afi.nc„ona^ 
^,ari„».readingta,eiscrea«d»berein*eg.eorDNAse,ue^ceonn^n^ 
«pressedasafi.sionpro,ein»ifta„alBm.ydomai,or..gse,«encedenvedfiom,he 
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10 



acceptor 



plasmid and under the expression control of the first promoter of the acceptor 

construct. - ^ * 

In another preferred embodiment, the fmal desired recombinaUon construct 

further comprises an acceptor-functional selectable marker gene derived from the 
acceptor construct. If an acceptor-functional selectable marker gene is present m 
addition to the newly-created recombinant-functional selectable marker, the acceptor- 
functional selectable marker ,s a different selectable marker from the newly-created 
recombinant-functional selectable marker. The present invention should not be Imuted 
by the nature of the selectable marker genes chosen; the marker genes may result m 
positive or negative selection and may be chosen from the group including, but not 
limited to, the chloramphenicol resistance gene, the ampicillin resistance gene, the 
tetracycline resistance gene, the kanamycin resistance gene, the streptomycm 
resistance gene, the strA gene and the sacB gene. 



15 Utility 
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The subject methods find use in a variety of different applications, where such 
applications are generally those protocols and methods in which the transfer of a 
nucleic acid of interest from one vector to another, e.g.. the cloning of a nucleic actd 
from an initial vector into a final vector, is desired. As such, the subject methods are 
particularly suited for use in cloning nucleic acids of interest, including whole 
libraries, from an initial vector into an expression vector, where the expression vector 
xnay be functionalized to express the polypeptide or protein encoded by the nucleic 
acid of interest located on it in a variety of different desired enviromnents and/or under 
desired conditions, e.g.. in a cell of interest, in response to a particular sthnulus. tagged 
by a detectable maricer, etc. 

As such, the expression vectors produced by the subject methods find use m a 
variety of different applications, including the study of polypeptide and protein 
function and behavior, i.e.. in the characterization of a po^reptide or protein, either 
known or unknown; and the like, hi the broadest sense, the subj ect methods find 
application in any method where traditional digestion and ligation protocols are 
employed to transfer or clone a nucleic acid from one vector to another, e.g.. clomng 
digestion and ligation protocols, where the expression vectors produced by the subject 
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methods find use in research appHcations, as well as other applications. e.g., protein 
production applications, therapeutic applications, and the like. 

Systems 

5 

Also provided are systems for use in practicing the subject methods. The 
subject systems at least include a donor vector and an acceptor vector as described 
above, hi addition, the subject systems may include a recorabinase which recognizes 
the recombinase sites present on the donor and acceptor vectors. The systems may also 
10 include, where desired, a host cell, e.g., in in vivo methods of expression vector 

production, as described above. Other components of the subject systems include, but 
are not limited to: reaction buffer, controls, etc. 

Libraries 

15 

Also provided are nucleic acid libraries cloned into donor and/or acceptor 
vectors of the subject invention. These nucleic acid libraries are made up of a plurality 
of individual donor/acceptor vectors where each distinct constituent member of the 
library has a different nucleic acid portion or component, e.g., genomic fiagment, 

20 cDNA, of an original whole nucleic acid library, i.e., fragmented genome, cDNA 

collection generated from the total or partial mRNA of an mRNA sample, etc. In other 
words, the libraries of the subject invention are nucleic acid libraries cloned into donor 
or acceptor vectors according to the subject invention, where the nucleic acid Ubtaries 
include, but are not limited to, genomic Hbraries, cDNA libraries, etc. Specific 

25 donor/acceptor libraries of interest include, but are not limited to: Human Brain Poly 
A+ RNA; Human Heart Poly A+ RNA; Human Kidney Poly A+ RNA; Human Uver 
Poly A+ RNA; Human Lung Poly A+ RNA; Human Pancreas Poly A+ RNA; Human 
Placenta Poly A+ RNA; Human Skeletal Muscle Poly A+ RNA; Hmnan Testis Poly 
A+ RNA; Human Prostate Poly A+ RNA and the like. With donor libraries according 

30 to the subject invention, the subject methods permit the rapid exchange of either 

individual clones of interest, groups of clones or potentially an entire cDNA library to 
a variety of expression vectors. The cDNA library is generated using a pDonor 
construct as the cloning vector (a pDonor Ubrary, e.g., pDNR-Lib as shown in Fig. 
2D). The entire library may then be transferred (using either an in vitro or an in vivo 
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recombination react.on) mto any expression vector modified to contair, an acceptor 
sequence-specific recombinase target site a lox site) (f.e.. an acceptor construct). 
This solves an existing problem in the art. in that there is no way. using existing vector 
systems, to exchange the inserts in a library made in one expression vector en masse 
{i.e.. as an entire library) to a different expression vector. 



Kits 

Also provided are kits for use in practicing the subject methods. The subject 
10 kits at least include at least one donor vector and a recombinase that recognizes the 
recombinase sites of the donor vector. The subject kits may further include other 
components that find use in the subject methods, e.g.. acceptor vectors; reaction 
buffers, positive controls, negative controls, etc. 

In addition to the above components, the subject kits will fiirther include 
15 instructions for practicing the subject methods. These instructions maybe present m 
the subject kits inavarietyofforms, one or moreofwhich may be present in thektt. 

One form in which these instructions may be present is as printed infonnatton on a 
suitablemedium or substrate, e.g.,apieceorpiecesofpaper on which theinfonnatton 

is printed, in the packaging of the kit, in a package insert, etc. Yet another means 
20 would be a computer readable medium, e.g., diskette, CD, etc., on which the 

information hasbeen recorded. Yet anothermeans that may be present isawebsite 

address which maybe used via the internet to access the information ataremoved site. 
Any convenient means may be present in the kits. 



25 



30 



TT,e foUowing examples a.^ offered by way of illustration and not by way of 
limitation. 

PYPPRTMENTAL 

VXAMPLE 1 

rnnstmctior " p^nnor Construct 

This example describes a donor construct, the pD3 vector, which contained two 
loxP sites, apolylinker. a chloramphenicol resistance gene (Cm"^) open reading frame 
lacking apromoter. a standard origin of replication (derived from pUC19) and an 
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ampicilhn resistance gene (Anip"^) with its associated promoter. If a gene of interest is 
contained within pD3, any number of plasmid expression constructs contammg tos 
gene of interest can be constructed rapidly (e.g.. within a single day). The expression 
constructs (the acceptor construct or the pAcceptor used in this example was 
pCMVmycloxP (described below)) contained a sequence-specific recombinase target 
site, a promoter capable of expressing a gene of interest, an antibiotic resistance gene 
other than chloramphenicol {eg., ampicillin), and a promoter positioned such that 
upon recombination of the pAcceptor with pD3, the promoter drove expression of the 

Cm'^ open reading frame from pD3. 

Using a site-specific recombinase, Cre, a fragment of the initial donor construct 
encoding the gene of interest and the Cm'^ open reading frame recombined into the 
pAcceptor construct at its loxP site, resulting in the production of a vector in which the 
fragment of pD3 having the Cm"^ open reading frame was placed under the control of 
the second promoter on pCMVmycloxP. The recombination of pD3 and 
1 5 pCMVmycloxP to fomi the final desired recombinant construct was selected for by the 
abihty of cells transformed with the constructs to grow in the presence of 
chloramphenicol. 

The plasmid backbone used to generate pD3 was the pUCl 9 plasmid. Thus, 
the origin of repUcation and the second selectable marker gene of pD3 were the pUC 
20 origin of replication and the pUC Ampicillin resistance gene, respectively. Hiis base 
vector fiirther was derived to generate pD3 as follows: 

1 pUC19 was digested with AatU and Sapl to remove the region containing the 
LacZgeneandpolylinker(nucleotides 2617-2686; 1-690). Into the remaining 
25 fragment were cloned two double-stranded oligonucleotides made by amiealing the 
following two pairs of single stranded oligonucleotides: 

LoxPl-up: S'-CGCGGCCGCATAACTTCGTATAGCATACATTATACG 

AAGTTATCAGTCGACG-3' (SEQ ID No. 1); 

LoxPl-down: 5'-AATTCGTCGACTGATAACTTCGTATAATGTATGC 

30 TATACGAAGTTATGCGGCCGCGACGT-3' (SEQ ID No. 2); 

LoxP2-up: 5'-AATTCGGATCCATAACTTCGTATAGCATACATTAT 

ACGAAGTTATGCGGCC-3' (SEQ ID No. 3); 

LoxP2-down: S'-AGCGGCCGCATAACTTCGTATAATGTATGCTATA 

CGAAGTTATGGATCCG-3' (SEQ ID No. 4). 
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The « pair of oligonucleoMes =.co4ed ov^hangs for AalU ^ Ecm and 
„,e seccndpair encoded ovahangs for ^.oWa»dS»p.,Th»..wopai.o,„hg^ 

„ere tas ligated a. U.eir conunon toRI overhang and were s„bs.<pen«y ab . .o 
c,™.di„to*e.«,na„d&,p.-d,ges.edpUC>9Dm.lnd,eprocesa.d,e^«p.s,=waa 

, ,o« ,„addWon.ofl,eres,ric,iansi,e.™en,io„od.,heLoxP,-up/do™ol.gon«l«,.de 
p^ralso em^«iaM ai.e (GCOGCCGC) (SEQ m 5). SMarly. *eLoxP2. 

P^ir also encoded a .o, I sire and a .o„H. .He (GGATCC) (SEQ ID No. 6). 
This first construct is called pDl . 

0 2 pDlwasdigestedwithfia.HIand£coRI.andaPCRfragmentencoding^ 

cWora.phenicolresistar.ce gene open readmgframeCCn,^)^ 
(„ucleotides:1932-ni5.con.plen,entofthevectorpProTet.E121.avail^^^^ 

CLONTECH) was inserted using an Ecom site and a BglU site engineered mto the 
following reverse and forward PGR primers, respectively: ^ , 

15 Cm^fwd: S'-ATGCTTGATACTAGATCTTTCAGGAGCTAAGGAAGC 

TA-3' (SEQ JD No. 7); 

J-ev: 5--ATGCTGAATrCrG0ATCCIQGTCATGACTAGTGCTTG0- 

3' (SEQ ID No. 8). 

TOs resnUed to «« placemen, of fte Cm« open reading frame adjacent to the 
20 5-e.doftesecond.oxPsi,eb«in«>ereve,se5..oroHen.ation. toaddinon^the 
„Hgi«d Ba.m site in pDl waa destroyed and a new S»»HI .ite waa created adjacent 
to the Earn site. This vector is caUed pD2. 

3 pD2 was cut with NCl and religated, so as to invert the orientation of the 

25 o.sse«eencodi„g«.el-P«i<«-'"'«C"""P— 

.,^iciUin.esist«,cesele«ablemarke,mthepUC.9bacUhone.-n.iscons.ructwas 

caU6dpD3. 

4 pD3 is digests with EcoM andfiamHI and aPCR fragment encodingthe Tl 
30 tenninationse<,uence(n«cleotides232043ofpPROTet.E121)isi-^^^ 

methods. The resultant plasmid is pD4. 

5 pD4 is restricted with EcoU and Banim and a PGR fragment encoding the 
SV40pol>^enyIationsequence is cloned into the vector. The resultant vector 
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6. PD5 is digested with BamE\ and Sail and an oligo encoding a multiple cloning 
site is cloned into the BamHI and SaR sites to generate the final basic donor construct. 

5 

FXAMPLE 2 

rnnstniction of pCM V-mvc-LoxP 

Acceptor constructs for the donor recombination system are generally 

10 expression vectors which have been modified by the insertion of a loxP or other 

sequence-specific recombinase target site and a prokaryotic promoter in a position 3' 
of the sequence-specific recombinase target site and oriented such as to direct 
transcription through the sequence-specific recombinase target site. It is also possible 
to ufiUze readthrough transcription from other promoters in the expression vector, 

15 provided that their orientation and distance fi-om the loxP site is such that they can 
drive expression of the donor partial-selectable marker gene upon recombinationn of 
the acceptor vector with the desired fi^gment of the donor vector. The presence of a 
loxP site on the acceptor constnict permitted the rapid subcloning or insertion of the 
gene interest contained within the pDonor vector to generate a final recombination 

20 construct capable of expressmg the gene of interest. The acceptor construct may 

encode a protein domain such as an affinity domain or sequence tag including, but not 
limited to. glutathione-S-transferase (GST), maltose binding protein (MBP). protein A, 
protein L, a polyhistidine tract, the c-Myc Tag, the HA tag, the Flag Tag, Green 
Flourescence protein, etc. A variety of commercially-available expression vectors 

25 encoding such affinity domains and tag sequences are known in the art. When the 

acceptor construct encodes an affinity domain, a fiision protein comprising the affinity 
domain and the protein of interest is generated when the proper pDonor Segment and 
the acceptor constructs are recombined. 

To generate fmal recombination constructs having the appropriate 
30 transcriptional fiisions, a sequence-specific recombinase target site was placed after 
downstream of) the start of transcription in the acceptor construct. In designing 
the oligonucleotide coinprising the sequence-specific recombinase target site, care was 
taken to avoid introducing a start codon (the sequence "ATG") which might 
inappropriately initiate translation. Also, when generating a final recombination 
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construct product having an appropriate translational fusion between the acceptor- 
encoded protein domain and the donor-encoded gene of interest, care was taken to 
place the loxP site in the correct reading frame such that an open reading frame was 
maintained through the sequence-specific recombinase target site on pAcceptor. and 
5 the reading frame in the sequence-specific recombinase site on pAcceptor was in- 
frame with the reading frame found in the first sequence-specific recombinase target 
site contained within the pDonor construct. In addition, the oligonucleotide 
comprising the sequence-specific recombinase target site on pAcceptor and the first 
sequence-specific recombinase target site contained within the donor were designed to 
10 avoid the introduction of in-frame stop codons. The gene of interest contained within 
the pDonor construct was cloned in a particular reading frame so as to facilitate the 
creation of the desired fusion protein. 

Methods for modification of one expression vector are provided below to 
illustrate the creation of suitable pAcceptor constructs. The general strategy involves 
15 the generation of a linker containing a sequence-specific recombinase target site by 
annealing two complementary oligonucleotides. The amiealed oligonucleotides form a 
linker having sticky ends which were compatible with ends generated by restriction 
enzymes whose sites are conveniently located in the parental expression vector (e.g., 
within the polyHnker of the parental expression vector). In addition, but not 
20 necessarily, a prokaryotic promoter was cloned downstream of the sequence-specific 
recombinase target site with it's 5' to 3' orientation such that it directed expression 
through the sequence-specific recombinase target site. 

pCMV-myc-LoxP is an example pAcceptor construct. It was generated from 
pCMV-Myc (available from CLONTECH) in the following way: 
25 1 . pCMV-Myc was digested with ^I and BgHl. 

2. A double-stranded oligonucleotide encoding an overhang at its 5' end 
compatible with Sjil; a LoxP site; and an overhang at its 3' end compatible with Bgffl 
was generated by annealling the following oligonucleotides together: 

LoxPMyc-up: 5'-AGATAACTTCGTATAGCATACATTATACGAAG 

30 TTATA-3' (SEQ ID No. 09); 

LoxPMyc-down: S'-GATCTATAACTTCGTATAATGTATGCTATACG 

AAGTTATCTCCA-3' (SEQ ID No. 10). 

Hiis oligonucleotide was then cloned into the digested pCMVMyc vector to 

generate pAccl. 
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3. The plasmid pAccl was then digested with BglH and Nhel into which a 
PCR fragment encoding the ampicilUn promoter from pUC19 (nucleotides: 2620- 
2500, complement) was cloned. This fragment was generated using appropriate 
primers encoding Bamm and Nhel restriction sites as follows: 
5 AmpProFwd: 5'-ATGCTGGATCCAATATrATrGAAGCATITATCA GG- 

3' (SEQIDNo. 11); 

AmpProRev: 5'-TCCATGCTGCTAGCACGTCAGGTGGCACTTTTCG-3' 

(SEQIDNo. 12). 

The resultant plasmid is pCMVMycLoxP. which is a basic Acceptor plasmid 
10 having a LoxP site and adjacent promoter to drive expression of the gene of interest in 
the same 5' to 3' orientation as the LoxP site and a second promoter (acceptor partial 
selectable marker gene), oriented in the reverse 5' to 3' direction as the LoxP site and 
placed adjacent to the 3' end of said LoxP site. 

A similar strategy to generate other types of acceptor vectors will be readily 
15 apparent to those skilled in the art. This strategy can be employed to generate any 
number of pAcceptor constructs. It is only necessary to design the ohgos and PCR 
primers with appropriate restriction sites to match those in the polylinker of the 
construct to be adapted. 

20 EXAMPLE 3 

Generation of 10 additional acceptor vectors 

10 additional acceptor vectors, as described in Figures 3A-3J have been made as 
25 foUows. The construction of these vectors was as follows: 

Each parental vector used to generate the various acceptors was cut with two 
restriction enzymes that cut within the MCS of the vector, as detailed below. Into 
these was inserted a PCR fragment of approx. 170 bp generated using various primers 
30 (described below) and pCMVMycLoxP, the acceptor molecule described above (see 
example 2 above) as a template. The primers are named either LoxP or AmpPro (to 
designate to which part of the template they are complementary) plus the name of the 
restriction enzyme present in the 5' end of the primer. These restriction sites match 
the ones cut in the MCS of the vector to be modified. The fragment generated in th,s 
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PCR reaction encodes the LoxP site and the ampiciUin promoter from the 
pCMVMycLoxP acceptor. 

List of the 10 vectors used and restriction sites and primers used in the construction of 
5 each. 

1. PGADT7: Cut with EcoRJ and BamHI insert PCR fragment made with primers 
LoxP-EcoRI and AmpPro (cut with enzymes EcoRI and BamHI) 



10 



, pGBKT7: Cut with EcoRI and BamHI insert PCR fragment made with primers 
LoxP-EcoRI and AmpPro (cut with enzymes EcoRI and BamHI) 

3. pIRESneo: Cut with EcoRI and BamHI insert PCR fragment made with primers 
LoxP-EcoRI and AmpPro (cut with enzymes EcoRI and BamHI) 
15 4. pEGFP-ClrCutwithHindfflandBamHIinsertPCRfragmentmadewith 
primers LoxP-Hindlll and AmpPro-BamHI (cut with enzymes HindHI and 
BamHI) 

5. pECFP-Cl: Cut with HindHI and BamHI insert PCR fragment made with 
primers LoxP-Hindffl and AmpPro-BamHI (cut with enzymes HindlU and 
20 BamHI) 

6 pEYFP-Cl: Cut with Hindffl and BamHI insert PCR fragment made with 

primers LoxP-Hindffl and AmpPro-BamHI (cut with enzymes Hindffl and BamHI 



25 



7. pTRE2:Cutwith SacII and BamHI insert PCR fragment made with primers 
LoxP-sacH and AmpPro-BamHI (cut with enzymes SacH and BamHI) 

8. pRevTRE: Cut with Hindffl and Clal insert PCR fragment made with primers 
LoxP-Hindffl and AmpPro-Clal (cut with enzymes Hindffl and Clal) 

30 9. pLNCXiCutwithHindfflandClalinsertPCRfragmentmadewithprimers 
LoxP-Hindffl and AmpPro-Clal (cut with enzymes Hindffl and Clal 
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10 PIRES2-EGFP: Cut with EcoRl and BamHl insert PGR fragment made with 
' primers LoxP-EcoRl and AmpPro-BamHI (cut with enzymes EcoRI and BamHI) 

Primers for amplification of insert, providing various restriction enzyme ends 
(underlined) to enable cloning into above vectors. 



1. LoxP-EcoRI 

Sc«=«= (5'-3'). GATGCTSiMnCATAACrrCCTATAGCATACATrAT (SEQ 
IDNO:13)a36mer 

10 MW-0: 1 1025 MW-N: 1 1587 TM: 66.91666 Extinction Coef: 399 Mass(ug) per OD: 
29.0401 

2. AmpPro-BHI 

Sequence {5'-3'}: AGTCTGGATCCACGTCAGGTGGCACTTTrCG (SEQ ID 

15 NO:14)a31mer 

MW-0: 9512 MW-N: 9994 TM: 73.40323 Extinction Coef: 320 Mass(ug) per OD: 

31.23125 

3. LoxP-HiBdIII 

20 Sequence {5'-3r. ATGCTAAGCUCGATAACTrCGTATAGCATACATTAT (SEQ 
IDNO:15)a37mer 

MW-0: 11314MW-N: 1 1892 TM: 67.85135 Extinction Coef: 406 Mass(ug) per OD: 
29.29064 

25 4. AmpPro-Clal 

Sequence {5'-3-}: AGTCTAICQATACGTCAGGTGGCACTITrCG (SEQ ID 

NO:l6)a31mer 

MW-0: 9511 MW-N: 9993 TM: 71.20968 Extinction Coef: 325 Mass(ug) per OD: 
30.74769 

30 

5. LoxP-Nhel 

Sequence {5'-3T. TCCATGCTGCTAGCATAACTrCGTATAGCATACATTAT 
(SEQIDNO:17)a38mer 
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MW-O: ,1579 MW-N: ,2173 TM: 69.631 58 Extinction Coef: 405 Mass(ug) per OD: 
30.05679 

6.LoxP-SacIl 

5 Scuence iS'-T): TAGTACTCCGCGGATAACnCGTATAGCATACATTAT 
(SEQIDNO:18)a37mer 

MW-O: 1 1315 MW-N: 1 ,893 TM: 69.68919 Extinction Coef: 401 Mass(ug) per OD: 
29.65835 

,„ •n«pim«wc.a.lmadea.dPAGEpurifiedb,o»«^l.rs.ppl«r(Kcy«oncI*s) 
and «s»^d=d in wa.^ .0 a concenTation of 100 pmoW m water. 
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ir.XAMPLE4 

J. i/.v.. P^.nn,hinat^»» T T.^n. the t^Donor reconiMonSM 

n,e pDonor recombination system permits in vitro recombination of two 
constructs. Figure 1 provides schematic showing the strategy employed for in ntro 
recombination. pDNR-1,2,3 represent typicalpDonor constructs which contams two 

loxP sites, a chloramphenicol resistance gene open readmg frame which lacks a 
promoter, an origin ofreplication and an ampicillin resistance marker. The desired 

Acceptor vector shown contains a loxP site, a prokaryotic promoter in opposite 
orientation to the loxP site to drive the chloramphenicol open reading frame of pDNR- 
U3 an ampicillinresistancegene,aeukaryotic or prokaryotic promoter or fuston tag 

to^it expression ofthegeneofinterestunder appropriate conditions, andapUC 

origin of rq)lication. 

To «*ic« g«.««io» of to expression veew ftom donor «,d aecq,.or, 
U^fonowingweremixedtogdheroniceinasand^deppe^iorfmicrocentninse 

ttbe- 0 5 M pCMVmycloxP (representog Ike Acceptor vector); 0.5 fg pD3 
(represe«i.gtheDonorvector);2,J lOx ere reaction bufferdOxOe reaction buffer 

co„t.im:50O,nMTns-Ha(pH7.5)and30On^N.Cl); lOtnMMgtt, . Hl20x 
BSA (20. BSA eonuuns 2 ntg/nrl BSA (NEB)); 25 Units Cre recontoase (Novagen); 
H2O to 20 |il total. 
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Once the reagents were mixed, the reaction was incubated for 15 mins at 37«>C. 
Following the reaction, the mixture was heated to 65»CforlO™ins to inactivate the 

ere enzyme. Finally, an aliquot of the reaction mix was transfonned to E. coli usmg 
standard methods (e.g., electroporation). and the transformed bacteria selected on LB 
plates containing 60 ^g/ml Chloramphenicol. 

Alternatively, the pDonor vector may be incubated with Cre alone under the 
conditions described above; followed by purification of the fragment bearing the gene 
of interest, e.g., by gel electrophoresis, and subsequent recombination of the punfied 
fragment into the pAcceptor vector, again according to the method above. 

RXAMPLE 5 

y.. T.XP Sit es to increase Expression of the Protein of Intere st 

Tl.e pDonor and pAcceptor constructs employed in the pDonor recombmation 
1 5 system of the pre«»t invention are designed such that construct recombination results 
in the introduction of a loxP site between the promoter and the gene of interest. LoxP 
sites consist of two 13 bp inverted repeats separated by an 8 bp spacer region. 
Transcripts of the gene of interest produced from a pDonor-pAcceptor recomb.natron 
construct comprisingaloxP site have two IBnucleotide perfect inverted repeats w^than 

20 the 5' untranslat^l region (UTO) and have the potential to form a stem-loop structure, 
m fact, this wiU occur in those cases where pAcceptor does not encode an affimty 
domain at the amino-terminus of the fusion protein. However, it is possible also to 
constructpDonor andpAcceptor constructs containingmutatedloxP sequences. 

Mutated loxP sequences which comprise point mutations that create mismatches 
between the two 13 bp inverted repeat sequences within the loxP sites and have 
nusmatches at differentpositions in the inverted repeats located witWnaioxP^ 

be used. The suitability of any pair of mutated lox sites for use in the pDonor 
recombination system may be tested by replacing the sequence-specific recombmase 
target sites in pDonor and PAcceptor withasite to be tested. The two modified vectors 

are thenrecombinedmW/ro as describedinExampleSand the recombination reaction 
nuxtureisusedtotransform^. c./«cdls. TT^e transformed cells are then plated on 
selective medium (e.g.. Cm plates) in order to determine the efficiency of 
„«x,mbination between the two mutated lox sites (Example 3). The efficiency of 
recombination between the two mutated lox sites is compared to the efficiency of 
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recombination between two wild-type loxP sues. It will be apparent to those skilled it, 
the art thatasimilar strategy can be employedforthemodificationofmsites when 

the FLP recombinase is employed for the recombination event, or other such 
recombinasesitesasmightbeused. TT.e frt site, like lox sites, contains two 13bp 
5 inverted repeats separated by an 8 bp spacer region. 

FYAMPLE6 
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AU^m^tivft cQnfo "^''rinr,s of pAcceptor and pDon or 

The above-described constructs may be altered in the structural orgamzataon of 
their t^spective components, however, both constructs must be altered such that 
following recombination, the donor-partial and acceptor-partial selectable markers 
comprise an intact, recombinant-fimctional selectable maricer. and additionally, the 
firstpromoterisoperablylinkedtothegeneorDNAsequencesofinterest. For 

example, the invention could be done in a sunilar fashion as described, except that the 
positions and orientations of the donor-partial selectable marker on the Donor 
construct and the acceptor-partial selectable marker on the Acceptor construct are 
switched. TTte final result of the recombination between the proper donor fragment (or 
first donor fragment) and the acceptor construct still generates a recombmant- 
ftnctionalselectionmarker. Likewise, vectors such that the selection marker 
comprises two fragments and forms a recombmant-fi.nctional selectable marker m the 
final product by reading throu^ the second sequence-specific recombinase target sUe 
are also included within this invention. 



ir.YAMPLE7 



n,„,ntirn -f-"'>T'- --^^°» constructsfoLludfa^^ 

UsingtheDonor recombination system, it is possible to transfer one or many 
30 genes into multiple acceptor expression vectors at substantially the same time. To 
demonstrate this, the luciferase gene cloned into the multiple cloning site of pDNR-1, 
so generating pDNR-luc. The luciferase gene was then transferred from pDNR-luc 
into transferred to 10 different acceptor vectors simultaneosly using the method 
described in EXAMPLE 4. To do this, each often individual acceptor vectors, as 
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detailed in Figs 3A to 3J, was placed in an eppendorf tube in reaction buffer as 
described in EXAMPLE 4. To each tube was then added 200 ng of pDNR-luc and 1 
unit of ere recon^binase. The reactions were incubated at 37 °C for 15 mins and then 
the Cre recombinase was inactivated by heating to 65 °C for 10 mins. Each reactton 

5 was then transfonned individually into a separate aliquot of electo-competent DH5- 
alpha E coii. These were allowed to grow for 1 hour in the absence of selection and 
then were plated out on selective agar plates contaimng 30 ug/ml chloramphenicol and 
r/o w/v sucrose. The following day. 3 colonies from each transformation were picked 
and gn>wn-up for mini prep restriction digest analysis to determine if the desired 

10 recombinant had been made. Of the 30 clones analyzed in total (3 for each construct). 
27 were correct, thus demonstrating that it is possible using the subject methods to 
readily generate multiple expression constructs - in this example 10 constructs - in a 
single day. 



15 



FXAMPLE 8 

r^„„,,,,,,„ fnr HEK ?9^ cHls transfected usinp Creator^ and 

20 gtardard vectors. 

To compare the expression level achievable with Creator vectors (i.e., donor 
and acceptor vectors of the subject methods) to that generated using standard vectors, 
HEK 293 cells were transfected using the Calcium Phosphate method with either the 
pLP-EGFP-luc expression vector generated as part of example 7 above , or the 

25 comparablevectormadeusingtraditionalcloningmethods-pEGFP-Luc(available 

from CLONTECH). 24 hours after transfection. the level of fluorescence and the % of 
cells transfected was determined by both fluorescence microscopy and by FACS 
analysis. The resuU showed that while there is some reduction in expression 
associated with the Creator vectors, it is not a significant hindrance to adequate 
30 expression. 

EXAMPLE 9 

^r^,y. interartion by ye «st two-hybrid analysis 
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were i 
werel 



The interaction of myc and max proteins was tested by yeast two-hybrid 
interaction. To do this, relevant coding fragments of the human myc and max genes 
first cloned in to pDNR-1 by standard restriction cloning methods. These genes 
; then each transferred by use of the subject methods, as described in example 4 
5 above, to both the pLP-GBKT7 (GAL4 DNA binding domain - bait vector) and the 
pLP-GADT7 (GAL4 activation domain - prey vector). AH109 yeast cells were then 
co-transfcimed with either pLP-GADT7 and pLP-GBKT7 alone, or with the same two 
expression vectors, but containing either myc or max. The yeast were then grown on 
selective medium lacking leucine and tryptophan in order to select for growth of yeast 
10 containing both constructs. The strength of the interaction between the protein 
expressed in the bait and the prey constructs was then determined using an alpha- 
galactosidase quantitative assay and normalized for culture density, as described in the 
MATCHMAKER system 111 user manual (available from CLONTECH). In this way, 
it was shown that myc and max interact well, but the homodimers do not. 



ff.y AMPLE 10 



^„A,.riy.U ..v pressinn of Ip riferase in HeLa cells using pLP-TRE-Luc 

The expression construct pLP-TRE-Luc generated by recombination of pLP- 
20 TRE and pDNR-Luc as described in Example 7 above, according to the method in 
example 4 above, was transfected into HeLa Tet-Off cells (available from 
CLONTECH) using the geneporter Upofection kit (available from Gene Therapy 
Systems). The cells were then cultured for 48 hrs in the presence of varying 
concentrations of doxycycline. The cells were then harvested and assayed for 
luciferase activity. The luciferase activity varied over several orders of magnitude 
from high level expression to background levels, dependent on the level of 
Doxycycline present in the growth medium. 
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KXAMPLE 11 



Hi ph level lucifera-^^ indnction with p T P-TRE-luc and pT P-FevTre-Luc compared to 
pTRE-luc 

As described in example 8 above, the level of expression from Creator vectors can be 
somewhat reduced when directly compared to comparable expression vectors made 
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using convent.onalcloningn.ethods.lt should be noted in this exan^^^^ 

and n^aximanevelsofexpressionarereduced. It is thought likelythatthisreduc^^^ 

due to inhibition of RNA translation due to hairpins caused by the palindromtc lex 
sites Asshownabove.thisreductionse«nstohavenosignificantlydetrin.entaleffect 
on the ft^ctionalnyofanyofthe expression vectors tested, in this current exarnplewe 

further show that this reduction in expression may actually be beneficial in the case of 

inducible expression. This is because the decrease in expression caused by the lox 

sites seems to more greatly affect low-level expression than it does maximal 

expression. For this reason, when fold induction of tet inducible vectors (either 

, plasmid-based or retro viral) is compared between standard vectors and creator vectors 

the fold induction seen is much greater in the case of the creator vectors. To 
demonstrate this.HeLatet-off cells were transiently transfectedwithpTRE2-l.co^^ 

PLP-TOE2-1UC. or stabily infected with pRevlM-luc. Cells were then grown for 48 
hrs in the presence or absence of lug/ml doxycycline and then assayed for luciferase 
5 activity. Both pLP-TRE2-luc and pLP-RevTRE-Luc were observed to show greater 
fold induction than pTRE2-Luc. 
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EXAMPLE 12 

Construction of other accq)tor vectors 

All of the following acceptor vectors are made simply by taking the parental 
vector and usingPCR to insertasequenceencodingtheloxP site and the ampinlhn 

promoter, into the MCS of the vector. Note that this sequence is present m all of the 
10 acceptor vectors described above and can be obtained from them by PCR. 



1. pLP-Shuttle. an acceptor vector for transferring genes of intercut into an adenov^al 
vector,ismadeby inserting the above sequence into the Nhel site and Kpnl site ofthe 

30 pShutUe vector (available from CLONTECH). TOs vector could itself be used, 
without a gene of interest to then transfer the loxP site and ampiciUin promoter to 
adenoviralDNA,e.g..Adeno-XDNA (available from CLONTECH). so as to creator 

a adenovirus acceptor vector. 
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2- pU..BacPAK9,»accep.orvec.orfor™sfento8ga«softa.««.in.<.» 
Ju,«vWv=c,o,,is..debyinse«ing*= above se,„e„«ein.ofl,.EcoWs,.e«,d 
B;nsUeof*epB3cPAK,vec.o.<avan*^™CLONTBCH),^.v»U,,co^ 

p™„.er ,0 bac„,ov.„ D^, e... Baculo G«M DKA ,.vai.ab,e ,^n^l 
so as to ciealor a baculoviius acceptor vector. 

3. pLP-CMV.Myc, an acceptor vector providing co„atit,«ive r^im «p.esdo. 
,„ ftolth.CMVp,on,oterofn„.epit^e.taggedgcneofin,«.sUsmad.byi«««ng 

rabovese,uLtatot.eSMaiteandB,Usi.eoftbepCMV-Mycvec.o,(av»W,.e 

fiomCLONTECH) 

4. PIP.CMV-HA, an acceptorvector providing constitutivettta^-iap^pr-i- 
15 ftlth.CMVpro,no.erofHAepitope-taggedgepeofi.teres..ismad.b,n«e,« 

Lbovese,uencein.otbeS«site»ndBgm»iteof.hepCMV-HAve«or(.v.1ab,e 

SomCLONTECH) 

5- pLP-PR0Te..6x(HN), an acceptor vector providing T..-i.«luciblebaOeriaI 
20 ^onofa6x(HN>taggedgeneofintcrest,isn,adebyinse.ting.hc.b.v= 
seo.encei„to.heHinDniandaalsite.ofpPROT«.E133(availab..ftom 
^raSi„cet«svec.orha.cb,oran,phenico„esis.ance,i.sho..d.dd,t,ona», 

be nodiSed by changing chloramphenicol to an-picfflin r«Wanc. 



is evid«.t ftom the above resulta and discussion .ha. the subjec inve«io. 
,^aes»..fHci».~.bodu..ansferanuc.eicacid6on,ai«v««,oas«o.rf 

Lor, »h«. the object m«hods do no, ™ploy digestion and ligaHon pn,toe.l. 
rva:.g«P™videdby.h.s„bjecttaven.oninclode:.heabi,i.y»«.»f.«c.-a 

,„ nncleic acid ofinteres. »o.. a shrgle donor «o a variety of ^"P^^^, 
v=c.»ats.bsUn«a..y*.«»cta.andinal=nownorten.ahonandreadn,g^ 

,he«,y».eadilyiden.i.ys„ccessfclclo^s;.heabi.i.y.otransfer,„any.hffe^ 

geneswoneormore expression ve^orssimnltaneously; no longer needrng^ 

Lpence.heiu.«io.sof.he«nsfer,edfi.^=n.a.d*eexpressionve«ororto 
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resequence the gene transferred and the like. As such, the subject invention represents 
a significant contribution to the art. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of 
any publication is for its disclosure prior to the filing date and should not be construed 
as an admission that the present invention is not entitled to antedate such publication 
by virtue of prior invention. 



10 



Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spint or 
1 5 scope of the appended claims. 
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>VHATTS CT. AIMED IS: 

1. A method of producing an expression vector, said method comprising: 

combining a donor vector and an acceptor vector with a recombinase under 
conditions sufficient for site-specific recombination to occur, wherein said donor and 
acceptor vectors comprise components sufficient to give rise to an expression vector 
comprising first and second recombinase recognition sites oriented in the same 
direction, wherein said first and second recombinase recognition sites are able to 
recombine with each other; and wherein said donor and acceptor vectors are fiirther 
characterized in that one of said donor and acceptor vectors comprises a single 
recombinase recognition site and the other of said donor and acceptor vectors 
comprises two recombinase recognition sites; 
to produce said expression vector. 

15 2. nie method according to Claim l. wherein said donor vector comprises two 
recombinase recognition sites, wherein said recombinase recognition sites are able to 
recombine with each other. 

3. TTie method according to Claim 2. wherein said two recombinase sites flank a 
20 at least one restriction endonuclease site. 

4. The method according to Claim 2. wherein said acceptor vector comprises a 
single recombinase recognition site, 

25 5. -fte method according to Claim 4. wherein said single recombinase recognition 
site is located on said acceptor vector between two promoters oriented in different 
directions. 

6. Themethodaccoidingtoaaim l, wherein said donor vector comprises a 
30 single recombinase recognition site and at least one restriction endonuclease site. 

7. The method according to Claim 1 . wherein said acceptor vector comprises two 
recombinase recognition sites. 
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8. The method according to Claim 1 , wherein said expression vector produced by 
said method further comprises an expression cassette comprising a coding sequence 
and a promoter, wherein said promoter and coding sequence flank said first 
recombinase recognition site. 

^ 9. A method of producing an expression vector comprising an expression cassette 
for a protein of interest, said method comprising: 

combining a donor vector and an acceptor vector with a recombinase under 
conditions sufficient for site-specific recombination to occur, wherein: 
10 (a) said donor vector comprises two recombinase recognition sites onented 

in the same direction and flanking a coding sequence for said gene of interest and a 
coding sequence for a selectable marker; and 

(b) said acceptor vector comprises a single recombinase recognition site 
flanked by first and second promoters oriented in opposite directions; 

whereby an expression vector comprising first and second recombinase 
recognition sites, wherein said first recombinase recognition site is present inside said 
expression cassette and flanked by said first promoter and said coding sequence for 
said gene of interest and said second recombinase recognition site is present in a 
functional selectable marker and is flanked by said coding sequence for said selectable 
20 marker and said second promoter; 

to produce said expression vector comprising an expression cassette for said 

protein. 

10. -me method according to Claim 9, wherein said first and second recombinase 
25 recognition sites are oriented in the same direction on said expression vector. 

11. The method according to Claim 9, wherein said first and second promoters are 
oriented in opposite directions on said expression vector. 

30 12. Tl,emethodaccordingtoClaim9,whereinsaidrecombinaseisaCre 
recombinase. 

13. The method according to Claim 9, wherein said recombinase recognition sites 
are lox sites. 
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14. The method according to Claim 9, wherein said expression vector is aplasmid. 

15. The method according to Claim 9. wherein said donor and acceptor vectors are 

5 plasmids. 

16 A donor vector comprising: 

first and second recombinase recognition sites flanking at least one restrictton 
endonuc.easesiteandaportionofaselectablemarker.whereinsaid first andsecond 

10 recombinase recognition sites are able to recombine with each other. 

17. The donor vector according to Claim 16, wherein said first and second 
recombinase recognition sites are oriented in the same direction. 

15 18. The donor vector according to Claim 16. wherein said portion of said 
selectable marker is a coding sequence. 

19. The donor vector according to Claim 1 6, wherein said recombinase recognition 
sites are lox sites. 

20. ThedonorvectoraccordingtoClaiml6.whereinsaidvectorisaplasmid. 

21 An acceptor vector comprising: 

a single recombinase recognition site located between a first promoter and a 

25 portion of a selectable marker, whereir. said first promoter is not operably Imked to a 
coding sequence. 

22 nie acceptor vector according to Claim 21, wherein said portion of a selectable 
n^arkerisasecondpromoter, wherein said secondpromoter is not operably linkedtoa 

30 coding sequence and is oriented in the opposite direction of said first promoter. 

23. The acceptor vector according to Claim 21, wherein said acceptor vector 
further comprises an origin of replication. 
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24. The acceptor vector according to Claim 21, wherein said recombinase 
recognition site is a lox site. 

25. The acceptor vector according to Claim 21. wherein said acceptor vector is a 
5 plasmid,cosmid,bac,yac or virus. 

26. An expression vector comprising: 

(a) first and second recombinase recognition sites oriented in the same 
direction and flanking at least one restriction endonuclease site; and 
10 (b) a fimctional marker divided into two sub-parts flanking said second 

recombinase recognition site. 

27. The expression vector according to Claim 26, wherein said expression vector 
comprises a fimctional expression cassette for a protein of interest. 

^^28 -me expression vector according to Claim 26. wherein said expression cassette 
comprises said first recombinase recognition site flanked by a first promoter and a 
protein of interest coding sequence. 

20 29 The expression vector according to Claim 26, wherein said fimctional maricer 
comprises said second recombinase recognition site flanked by a second promoter and 
a selectable marker coding sequence. 

30. nie expression vector according to Claim 26. wherein said first and second 
25 recombinase recognition sites are lox sites. 

31. The expression vector according to Claim 26, wherein said vector is a plasmid, 
cosmid, bac, yac or virus. 

30 32. A system for use in preparing an expression vector, said system comprising: 
(a) a donor vector comprising first and second recombinase recognition 
sites flanking at least one restriction endonuclease site and a first 
portion of a selectable maricer; and 
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(b) an acceptor vector comprising a single recombinase recognition site 
located between a first promoter and a second portion of a selectable 
marker, wherein said first promoter is not operably linked to a coding 
sequence. 

5 

33. The system 
marker is a second promoter. 



, according to Claim 32. wherein said second portion of a selectable 



34. The system according to Claim 33. wherein said first and second promoters are 
1 0 oriented in opposite directions on said acceptor vector. 

35. The system according to Claim 32, wherein said system fiirther comprises a 
recombinase. 

15 36. The system according to Claim 35, wherein said recombinase is a Cre 
recombinase. 

37. The system according to Claim 32, wherein said recombinase recognition sttes 
of said donor and acceptor vectors are lox sites. 



20 



38. 



n^e system according to Claim 32. wherein said donor and acceptor vectors are 



39. The system according to Claim 32. wherein said system further comprises a 
25 host cell. 

40 A kit for use in a producing an expression vector, said kit comprising: 

(a) a donor vector comprising first and second recombinase recognition 
sites flanking at least one restriction endonuclease site and a first 

30 portion of a selectable marker; and 

(b) a recombinase that recognizes said first and second recombinase 
recognition sites. 
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^grt-ion *es of said do,K,r v=c,o, are oriented i. s»n. *cc„o». 

«. Tl,eB.a^rding,oClata40.whe,ei„sai<.do«vec,orc<»npriscsam..Up.e 

5 cloning site. 

4, Tteldt.ceordi„gtoClai™40,where,„saiaf,..portonof«iaselec«le 
L„pr.««on-dono,vecu,risacod,nsse,»e„ceforase,.«a..e™Hce,. 

,„ 44. Th.«.aeco*.u,aain,40.»herei„-d,eeo,Bbi„a.isaaercco*i..«. 

45. Tl,e>d.»xoriing.oC.a™40.wherei„.^dWandseeond,ecombina« 
recognition sites are lox sites. 
,5 46. T„. Id. «xordtagU. Claim 40, whereta said do„o,vee.or is. placid. 

4, Tte aceonling » claim 40, wherein said Id. talher comprises an accep.o, 

^co™pnsing.si.g.ereeoMHnase.eo,„i.onsi.e,oea,edU™ee.a^ 

p^.era„d.aeco.dpo«ionofaselec«blen»*.r,whe,^sa,dfi.^p.H,o..r.s 

20 not operably linked .0 a coding sequence. 

48. Theli.acco*g.oaain.46,wh«einsaidsecondportionofase>ec«blc 

marker is a. second promoter. 
„ 49. neM.according»Cl.in>40,whe«i«saidld.f«e,coMpri«.'-««- 

50 Ueld.«co*|«.Claim40.whereins.idlci.<«.-omprisess«b»a.. 
^gir^onsforpracticinga-mcdiodofaain, 1 and/or means for 

obtaining said instructions. 

5, Anucleicacidiibraryclonedintoapluralityofvectorsselectedfrorn*^ 
consisting of donor and acceptor vectors. 
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52. n.e nucleic acid library according to Claim 51 , wherein said library is cloned 
into a plurality of donor vectors. 

53. The nucleic acid library according to Claim 51, wherein said Ubrary is a 
genomic library. 

54. Tl,e nucleic acid library according to Claim 5 1 . wherein said library is a cDNA 
library. 
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Figure 1B 



pDNR-1 (readiog frame 1): 

Ah 

^CA GTC 6 AC GHAO: 6 G A CAT^G MG^^ 

"imri^ w»i f«Ri BBifMi 



95 STOP* 
A AGC ITT aA GA CCATTCGTHOG C GCG C6G GCC CAGtAfiCTAAGfGM 



Apal 



pDNR-2(reaiiiBgframe2): 

45 

WCA CTC G AC T GGTACCAG ACA TAT GCCCG^ 



9* 

^Cn TPT aR&nTATTCGTTTBGCGCGCGCATGCAGTAGSTAAGTGA 



pDNR*3 ireadisg frame 3): 

45 

^rCACTCGACTOG GTACCGAGCATATGCM^AT^^ 



94 



AAGCTTATCTAGA CAn CGT TTG GCGCGC MiLCM AGT ABGTAA 
H'mAm XbMl tell 
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Figure 2A 
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Figure 2B 
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Figure 2C 
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Figure 2D 
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Figure 3B 




4 c-Myc epitope tag 
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Figure 3D 
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Figure 3F 
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Figure 3G 
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Figure 3H 
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SEQUENCE LISTING 

<110> Farmer, Andrew 

Recombinase-Based Methods for Producing 
^'sxpressiS V^^^^ and Compositions for Use in Practxcxng 

the Same 
<130> CLON-033WO 

<150> 09/356,011 
<151> 1999-07-14 

<160> 18 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

cgcggccgca taacttcgta tagcatacat tatacgaagt tatcagtcga eg 

<210> 2 
<211> 60 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> oligonucleotide 

llTc^C. ctgataactt cgtataatgt atgctatacg aagttatgcg gccgcgacgt 

<210> 3 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

alSJcggatc cataacttcg tatagcatac attatacgaa gttatgcggc c 51 

<210> 4 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

agcggcJgca taacttcgta taatgtatgc tatacgaagt tatggatccg 50 

<210> 5 
<211> 8 



60 



-1- 
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<212> DNA 

<213> artifical sequence 

<400> 5 
gcggccgc 

<210> 6 
<211> 6 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> restriciton site 

<400> 6 
ggatcc 

<210> 7 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> per primer 



iJgcJtgata ctagatcttt caggagctaa ggaagcta 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> per primer 



atgctgaatt ctggatcctg gtcatgacta gtgcttgg 

<210> 9 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide 



agataacttc gtatagcata cattatacga agttata 

<210> 10 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 



<400> 10 



-2- 
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gatctataac ttcgtataat gtatgctata cgaagttatc tcca 



<210> 11 
<211> 33 
<212> DNA 

<213> prtificial Sequence 
<220> 

<223> primer 



<400> 11 

atgctggatc caatattatt gaagcattta tea 

<210> 12 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 12 

tccatgctgc tagcacgtca ggtggcactt ttcg 

<210> 13 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 13 

gatgctgaat tcataacttc gtatagcata cattat 

<210> 14 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 14 

agtctggatc cacgtcaggt ggcacttttc g 

<210> 15 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 15 ^ 4-*. 

atgctaagct tcgataactt cgtatagcat acattat 



<210> 16 
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<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 16 

agtctatcga tacgtcaggt ggcacttttc g 

<210> 17 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 17 ^ *.4. 4- 

tccatgctgc tagcataact tcgtatagca tacattat 

<210> 18 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



tagtactccg cggataactt cgtatagcat acattat 
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